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FIELD OF THE INVENTION 

The present invention relates generally to the fields of genetics and cellular 
and molecular biology. More particularly, the invention relates to novel G protein 
coupled receptors, to polynucleotides that encode such novel receptors, to reagents 
20 such as antibodies, probes, primers and kits comprising such antibodies, probes, 

primers related to the same, and to methods which use the novel G protein coupled 
receptors, polynucleotides or reagents. 

BACKGROUND OF THE INVENTION 

25 The G protein-coupled receptors (GPCRs) form a vast superfamily of cell 

surface receptors which are characterized by an amino-terminal extracellular domain, 
a carboxyl-terminal intracellular domain, and a serpentine structure that passes 
through the cell membrane seven times. Hence, such receptors are sometimes also 
referred to as seven transmembrane (7TM) receptors. These seven transmembrane 

30 domains define three extracellular loops and three intracellular loops, in addition to 
the amino- and carboxy- terminal domains. The extracellular portions of the receptor 
have a role in recognizing and binding one or more extracellular binding partners 
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(e.g., ligands), whereas the intracellular portions have a role in recognizing and 
communicating with downstream molecules in the signal transduction cascade. 

The G protein-coupled receptors bind a variety of ligands including calcium 
ions, hormones, chemokines, neuropeptides, neurotransmitters, nucleotides, lipids, 
odorants, and even photons, and are important in the normal (and sometimes the 
aberrant) function of many cell types. [See generally Strosberg, Eur. J. Biochem. 
196:1-10 (1991) and Bohm et al, Biochem J. 322:1-18 (1997).] When a specific 
ligand binds to its corresponding receptor, the ligand typically stimulates the receptor 
to activate a specific heterotrimeric guanine-nucleotide-binding regulatory protein 
(G-protein) that is coupled to the intracellular portion of the receptor. The G protein 
in turn transmits a signal to an effector molecule within the cell, by either stimulating 
or inhibiting the activity of that effector molecule. These effector molecules include 
adenylate cyclase, phospholipases and ion channels. Adenylate cyclase and 
phospholipases are enzymes that are involved in the production of the second 
messenger molecules cAMP, inositol triphosphate and diacyglycerol. It is through 
this sequence of events that an extracellular ligand stimuli exerts intracellular changes 
through a G protein-coupled receptor. Each such receptor has its own characteristic 
primary structure, expression pattern, ligand-binding profile, and intracellular effector 
system. 

Because of the vital role of G protein-coupled receptors in the communication 
between cells and their environment, such receptors are attractive targets for 
therapeutic intervention, for example by activating or antagonizing such receptors. 
For receptors having a known ligand, the identification of agonists or antagonists may 
be sought specifically to enhance or inhibit the action of the ligand. Some G protein- 
coupled receptors have roles in disease pathogenesis (e.g. , certain chemokine 
receptors that act as HIV co-receptors may have a role in AIDS pathogenesis), and are 
attractive targets for therapeutic intervention even in the absence of knowledge of the 
natural ligand of the receptor. Other receptors are attractive targets for therapeutic 
intervention by virtue of their expression pattern in tissues or cell types that are 
themselves attractive targets for therapeutic intervention. Examples of this latter 
category of receptors include receptors expressed in immune cells, which can be 
targeted to either inhibit autoimmune responses or to enhance immune responses to 
fight pathogens or cancer, and receptors expressed in the brain or other neural organs 
and tissues, which are likely targets in the treatment of schizophrenia, depression, 
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bipolar disease, or other neurological disorders. This latter category of receptor is 
also useful as a marker for identifying and/or purifying (e.g., via fluorescence- 
activated cell sorting) cellular subtypes that express the receptor. Unfortunately, only 
a limited number of G protein receptors from the central nervous system (CNS) are 
known. Thus, a need exists for G protein-coupled receptors that have been identified 
and show promise as targets for therapeutic intervention in a variety of animals, 
including humans. 

SUMMARY OF THE INVENTION 

The present invention relates to an isolated nucleic acid molecule that 
comprises a nucleotide sequence that encodes a polypeptide comprising an amino acid 
sequence homologous to even numbered sequences ranging from SEQ ID NO: 2 to 
SEQ CD NO: 94 and SEQ ID NO: 1 86, or a fragment thereof. The nucleic acid 
molecule encodes at least a portion of nGPCR-x. In some embodiments, the nucleic 
acid molecule comprises a sequence that encodes a polypeptide comprising even 
numbered sequences ranging from SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID 
NO: 186, or a fragment thereof. In some embodiments, the nucleic acid molecule 
comprises a sequence homologous to odd numbered sequences ranging from SEQ ID 
NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185, or a fragment thereof. In some 
embodiments, the nucleic acid molecule comprises a sequence selected from the 
group consisting of odd numbered sequences ranging from SEQ ID NO: 1 to SEQ ID 
NO: 93 and SEQ ID NO; 185, and fragments thereof. 

According to some embodiments, the present invention provides vectors 
which comprise the nucleic acid molecule of the invention. In some embodiments, 
the vector is an expression vector. 

According to some embodiments, the present invention provides host cells 
which comprise the vectors of the invention. In some embodiments, the host cells 
comprise expression vectors. 

The present invention provides an isolated nucleic acid molecule comprising a 
nucleotide sequence complementary to at least a portion of a sequence from an odd 
numbered sequence ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 
1 85, said portion comprising at least 10 nucleotides. 

The present invention provides a method of producing a polypeptide 
comprising a sequence from an even numbered sequence ranging from SEQ ID NO: 2 
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to SEQ ID NO: 94 and SEQ ID NO: 1 86, or a homolog or fragment thereof. The 
method comprising the steps of introducing a recombinant expression vector that 
includes a nucleotide sequence that encodes the polypeptide into a compatible host 
cell, growing the host cell under conditions for expression of the polypeptide and 
recovering the polypeptide. 

The present invention provides an isolated antibody which binds to an epitope 
on a polypeptide comprising a sequence from an even numbered sequence ranging 
from SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID NO: 1 86, or a homolog or 
fragment thereof. 

The present invention provides an method of inducing an immune response in 
a mammal against a polypeptide comprising a sequence from an even numbered 
sequence ranging from SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID NO: 186, or a 
homolog or fragment thereof The method comprises administering to a mammal an 
amount of the polypeptide sufficient to induce said immune response. 

The present invention provides a method for identifying a compound which 
binds nGPCR-x. The method comprises the steps of: contacting nGPCR-x with a 
compound and determining whether the compound binds nGPCR-x. 

The present invention provides a method for identifying a compound which 
binds a nucleic acid molecule encoding nGPCR-x, The method comprises the steps of 
contacting said nucleic acid molecule encoding nGPCR-x with a compound and 
determining whether said compound binds said nucleic acid molecule. 

The present invention provides a method for identifying a compound which 
modulates the activity of nGPCR-x. The method comprises the steps of contacting 
nGPCR-x with a compound and determining whether nGPCR-x activity has been 
modulated. 

The present invention provides a method of identifying an animal homolog of 
nGPCR-x. The method comprises the steps screening a nucleic acid database of the 
animal with an odd numbered sequence ranging from SEQ ID NO: 1 to SEQ ID NO: 
93 and SEQ ID NO: 185, or a portion thereof and determining whether a portion of 
said library or database is homologous to said odd numbered sequence ranging from 
SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 1 85, or portion thereof. 

The present invention provides a method of identifying an animal homolog of 
nGPCR-x. The methods comprises the steps screening a nucleic acid library of the 
animal with a nucleic acid molecule having an odd numbered nucleotide sequence 
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ranging from SEQ TD NO: 1 to SEQ rD NO: 93 and SEQ ID NO: 1 85, or a portion 
thereof; and determining whether a portion of said library oY database is homologous 
to said odd numbered nucleotide sequence ranging from SEQ ID NO: 1 to SEQ ID 
NO: 93 and SEQ ID NO: 185, or a portion thereof. 
5 Another aspect of the present invention relates to methods of screening a 

human subject to diagnose a disorder affecting the brain or genetic predisposition 
therefor. The methods comprise the steps of assaying nucleic acid of a human subject 
to determine a presence or an absence of a mutation altering an amino acid sequence, 
expression, or biological activity of at least one nGPCR that is expressed in the brain. 
10 The nGPCR comprise an amino acid sequence selected from the group consisting of: 
SEQ ID NO:74, SEQ ID NO:l 86, SEQ ID NO:78, SEQ ID NO:80, SEQ ID NO:82, 
SEQ ID NO:84, SEQ ID NO:86, SEQ ID NO:90, and SEQ ID NO:94, and allelic 
variants thereof A diagnosis of the disorder or predisposition is made from the 
presence or absence of the mutation. The presence of a mutation altering the amino 
15 acid sequence, expression, or biological activity of the nGPCR in the nucleic acid 
correlates with an increased risk of developing the disorder. 

The present invention further relates to methods of screening for an nGPCR- 
40 or nGPCR-54 hereditary schizophrenia genotype in a human patient. The methods 
comprise the steps of providing a biological sample comprising nucleic acid from the 
20 patient, in which the nucleic acid includes sequences corresponding to allelles of 

nGPCR-40 or nGPCR-54. The presence of one or more mutations in the nGPCR-40 
allelle or the nGPCR-54 allelle is detected indicative of a hereditary schizophrenia 
genotype. 

The present invention provides kits for screening a human subject to diagnose 
25 schizophrenia or a genetic predisposition therefor. The kits include an 

oligonucleotide useful as a probe for identifying polymorphisms in a human nGPCR- 
40 gene or a human nGPCR-54 gene. The oligonucleotide comprises 6-50 
nucleotides in a sequence that is identical or complementary to a sequence of a wild 
type human nGPCR-40 or nGPCR-54 gene sequence or nGPCR-40 or nGPCR-54 
30 coding sequence, except for one sequence difference selected from the group 

consisting of a nucleotide addition, a nucleotide deletion, or nucleotide substitution. 
The kit also includes a media packaged with the oligonucleotide. The media contains 
information for identifying polymorphisms that correlate with schizophrenia or a 
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genetic predisposition therefor, the polymophisms being identifiable using the 
oligonucleotide as a probe. 

The present invention further relates to methods of identifying nGPCR allelic 
variants that correlates with mental disorders. The methods comprise the steps of 
5 providing biological samples that comprise nucleic acid from a human patient 
diagnosed with a mental disorder, or from the patient's genetic progenitors or 
progeny, and detecting in the nucleic acid the presence of one or more mutations in an 
nGPCR that is expressed in the brain. The nGPCR comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO:74, SEQ ID NO: 186, SEQ ID 
10 NO:78, SEQ ID NO:80, SEQ ED NO:82, SEQ ID NO:84, SEQ ID NO:86, SEQ ID 
NO:90, and SEQ ID NO:94, and allelic variants thereof. The nucleic acid includes 
sequences corresponding to the gene or genes encoding nGPCR. The one or more 
mutations detected indicate an allelic variant that correlates with a mental disorder. 
The present invention further relates to purified polynucleotides comprising 

15 nucleotide sequences encoding allelles of nGPCR-40 or nGPCR-54 from a human 
with schizophrenia. The polynucleotide hybridizes to the complement of SEQ ID 
NO:83 or of SEQ ID NO:85 under the following hybridization conditions: (a) 
hybridization for 16 hours at 42 °C in a hybridization solution comprising 50% 
fonnamide, 1% SDS, 1 M NaCl, 10% dextran sulfate and (b) washing 2 times for 30 

20 minutes at 60 °C in a wash solution comprising O.Ix SSC and 1% SDS. The 

polynucleotide that encodes nGPCR-40 or nGPCR-54 amino acid sequence of the 
human differs from SEQ ID NO:84 or SEQ ID NO:86 by at least one residue. 

The present invention also provides methods for identifying a modulator of 
biological activity of nGPCR-40 or nGPCR-54 comprising the steps of contacting a 

25 cell that expresses nGPCR-40 or nGPCR-54 in the presence and in the absence of a 
putative modulator compound and measuring nGPCR-40 or nGPCR-54 biological 
activity in the cell. The decreased or increased nGPCR-40 or nGPCR-54 biological 
activity in the presence versus absence of the putative modulator is indicative of a 
modulator of biological activity. 

30 The present invention further provides methods to identify compounds useful 

for the treatment of schizophrenia. The methods comprise the steps of contacting a 
composition comprising nGPCR-40 with a compound suspected of binding nGPCR- 
40 or contacting a composition comprising nGPCR-54 with a compound suspected of 
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binding nGPCR-54. The binding between nGPCR-40 and the compound suspected of 
binding nGPCR-40 or between nGPCR-54 and the compound suspected of binding 
nGPCR-54 is detected. Compounds identified as binding nGPCR-40 or nGPCR-54 
are candidate compounds useful for the treatment of schizophrenia. 

The present .nvention further provides methods for identifying a compound 
useful as a modulator of binding between nGPCR-40 and a binding partner of 
nGPCR-40 or between nGPCR-54 and a binding partner of nGPCR-54. The methods 
compose the steps of contacting the binding partner and a composition compnsmg 
nGPCR-40 or nGPCR-54 in the presence and in the absence of a putative modulator 
0 compound and detecting binding between the binding partner and nGPCR-40 or 

nGPCR-54 Decreased or increased binding between the binding partner and nGPCR- 
40 or nGPCR-54 in the presence of the putative modulator, as compared to binding in 
the absence of the putative modulator is indicative a modulator compound useful for 
the treatment of schizophrenia. 
, 5 Another aspect of the present invention relates to methods of purifying a G 

protein from a sample containing a G protein. The methods compose the steps of 
contacting the sample with an nGPCR for a time sufficient to allow the G prote.n to 
form a complex with the nGPCR; isolating the complex from remaining components 
of the sample; maintaining the complex under conditions which result in dissociation 
20 of the G protein from the nGPCR; and isolating said G protein from the nGPCR. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Definitions 

Various definitions are made throughout this document. Most words have the 
25 meaning that would be attobuted to those words by one skilled in the art. Words 
specifically defined either below or elsewhere in this document have the meaning 
provided in the context of the present invention as a whole and as are typically 
understood by those skilled in the art. 

"Synthesized" as used herein and understood in the art, refers to 
30 polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
"Wholly" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthesized DNAs embrace those where.n only portions of the 
resulting DNA were produced by chemical means. 
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By the term "region" is meant a physically contiguous portion of the primary 
structure of a biomolecule. In the case of proteins, a region is defined by a contiguous 
portion of the amino acid sequence of that protein. 

The term "domain" is herein defined as referring to a structural part of a 
5 biomolecule that contributes to a known or suspected function of the biomolecule. 
Domains may be co-extensive with regions or portions thereof; domains may also 
incorporate a portion of a biomolecule that is distinct from a particular region, in 
addition to all or part of that region . Examples of GPCR protein domains include, 
but are not limited to, the extracellular (i.e., N-terminal), transmembrane and 

10 cytoplasmic (i.e., C-terminal) domains, which are co-extensive with like-named 

regions of GPCRs; each of the seven transmembrane segments of a GPCR; and each 
of the loop segments (both extracellular and intracellular loops) connecting adjacent 
transmembrane segments. 

As used herein, the term "activity" refers to a variety of measurable indicia 

15 suggesting or revealing binding, either direct or indirect; affecting a response, i.e. 
having a measurable affect in response to some exposure or stimulus, including, for 
example, the affinity of a compound for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of 
upstream or downstream proteins or other similar functions after some stimulus or 

20 event. 

Unless indicated otherwise, as used herein, the abbreviation in lower case 
(gpcr) refers to a gene, cDNA, RNA or nucleic acid sequence, while the upper case 
version (GPCR) refers to a protein, polypeptide, peptide, oligopeptide, or amino acid 
sequence. The term "nGPCR-x" refers to any of the nGPCRs taught herein, while 
25 specific reference to a nGPCR (for example nGPCR-5) refers only to that specific 
nGPCR. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab', F(ab)2, and other fragments thereof. Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
30 chimeric antibodies and humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compounds or 
combinations thereof. Binding includes ionic, non-ionic, Hydrogen bonds, Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
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either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 
due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. Binding may be detected in many different 
5 manners. As a non-limiting example, the physical binding interaction between a 
nGPCR-x of the invention and a compound can be detected using a labeled 
compound. Alternatively, functional evidence of binding can be detected using, for 
example, a cell transfected with and expressing a nGPCR-x of the invention. Binding 
of the transfected cell to a ligand of the nGPCR that was transfected into the cell 
,0 provides functional evidence of binding. Other methods of detecting binding are 
well-known to those of skill in the art. 

As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecule, peptide, prote.n, sugar, 
nucleotide, or nucleic acid, and such compound can be natural or synthetic. 
, 5 As used here.n, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proximity to a polypeptide or polynucleotide 
of the invention. The polypeptide or polynucleotide can be in any number of buffers, 
20 salts, solutions etc. Contacting includes, for example, placing the compound into a 
beaker, m.crotiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucle.c acid molecule, or polypeptide encoding the nGPCR or 

fragment thereof. 

As used herein, the phrase "homologous nucleotide sequence," or 
25 "homologous amino acid sequence," or variations thereof, refers to sequences 

characterized by a homology, at the nucleotide level or amino ac.d level, of at least 
,he specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of prote.ns. Such .soforms can be expressed in different tissues 
of the same organism as a result of, for example, alternative splicing of RNA. 
30 Alternatively, isoforms can be encoded by different genes. Homologous nucleotide 
sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
aIso include, but are not limited to, naturally occumng allelic variations and mutations 
of the nucleotide sequences set forth herein. A homologous nucleotide sequence does 
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not, however, include the nucleotide sequence encoding other known GPCRs. 
Homologous amino acid sequences include those amino acid sequences which contain 
conservative amino acid substitutions and which polypeptides have the same binding 
and/or activity. A homologous amino acid sequence does not, however, include the 
5 amino acid sequence encoding other known GPCRs. Percent homology can be 
determined by, for example, the Gap program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 
Madison WI), using the default settings, which uses the algorithm of Smith and 
Waterman (Adv. Appl. Math., 1981, 2, 482-489, which is incorporated herein by 

10 reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that has been removed from its native environment. 
Examples of isolated nucleic acid molecules include, but are not limited to, 
recombinant DNA molecules contained in a vector, recombinant DNA molecules 

1 5 maintained in a heterologous host cell, partially or substantially purified nucleic acid 
molecules, and synthetic DNA or RNA molecules. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 

As used herein, the term "oligonucleotide" refers to a series of linked 

20 nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic 
or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a DNA sequence having at least about 10 

25 nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe 1 * refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 
depending on use. They are used in the detection of identical, similar, or 

30 complementary nucleic acid sequences. Longer length probes are usually obtained 
from a natural or recombinant source, are highly specific and much slower to 
hybridize than oligomers. They may be single- or double-stranded and carefully 
designed to have specificity in PCR, hybridization membrane-based, or ELISA-like 
technologies. 
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The term "preventing" refers to decreasing the probability that an organism 
contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 
5 The term "therapeutic effect" refers to the inhibition or activation factors 

causing or contributing to the abnormal condit.on. A therapeutic effect relieves to 
some extent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
,0 cells- (b) inhibition (i.e., slowing or stopping) of cell death; (c) inhibition of 

degeneration; (d) relieving to some extent one or more of the symptoms associated 
with the abnorma. condition; and (e) enhancing the function of the affected population 
of cells. Compounds demonstrating efficacy against abnormal conditions can be 
identified as described herein. 
, 5 The term "abnormal condition" refers to a function in the cells or tissues of an 

organism that deviates from their normal functions in that organism. An abnormal 
condition can relate to cell proliferation, cell differentiation, cell signaling, or cell 
survival An abnormal condition may also include obesity, d.abetic complications 
such as retinal degeneration, and irregularities in glucose uptake and metabolism, and 
20 fatty acid uptake and metabolism. 

Abnormal cell proliferative conditions include cancers such as fibrotic and 
mesangial disorders, abnormal angiogenesis and vasculogenes.s, wound healing, 
psoriasis, diabetes mellitus, and inflammation. 

Abnormal differentiation conditions include, but are not limited to, 
25 neurodegenerative disorders, slow wound healing rates, and slow tissue grafting 
healing rates. Abnormal cell signaling conditions include, but are not limited to, 
psychiatnc disorders involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
30 protein kinases are associated with the apoptosis pathways. Aberrations in the 

funct,on of any one of the protein kinases cou.d lead to cell immortality or premature 
cell death. 

The term "administering" relates to a method of incorporating a compound 
into cells or tissues of an organism. The abnormal condition can be prevented or 
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treated when the cells or tissues of the organism exist within the organism or outside 
of the organism. Cells existing outside the organism can be maintained or grown in 
cell culture dishes. For cells harbored within the organism, many techniques exist in 
the art to administer compounds, including (but not limited to) oral, parenteral, 
5 dermal, injection, and aerosol applications. For cells outside of the organism, 

multiple techniques exist in the art to administer the compounds, including (but not 
limited to) cell microinjection techniques, transformation techniques and carrier 
techniques. 

The abnormal condition can also be prevented or treated by administering a 

10 compound to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism function can then 
be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification" it is meant increased numbers of DNA or RNA in a cell 

15 compared with normal cells. "Amplification" as it refers to RNA can be the 

detectable presence of RNA in cells, since in some normal cells there is no basal 
expression of RNA. In other normal cells, a basal level of expression exists, therefore 
in these cases amplification is the detection of at least 1 to 2-fold, and preferably 
more, compared to the basal level. 

20 As used herein, the phrase "stringent hybridization conditions" or "stringent 

conditions" refers to conditions under which a probe, primer, or oligonucleotide will 
hybridize to its target sequence, but to no other sequences. Stringent conditions are 
sequence-dependent and will be different in different circumstances. Longer 
sequences hybridize specifically at higher temperatures. Generally, stringent 

25 conditions are selected to be about 5°C lower than the thermal melting point (T m ) for 
the specific sequence at a defined ionic strength and pH. The T m is the temperature 
(under defined ionic strength, pH and nucleic acid concentration) at which 50% of the 
probes complementary to the target sequence hybridize to the target sequence at 
equilibrium. Since the target sequences are generally present in excess, at T m , 50% of 

30 the probes are occupied at equilibrium. Typically, stringent conditions will be those 
in which the salt concentration is less than about 1.0 M sodium ion, typically about 
0.01 to 1.0M sodium ion (or other salts) at pH 7.0 to 8.3 and the temperature is at 
least about 30°C for short probes, primers or oligonucleotides (e.g. 10 to 50 
nucleotides) and at least about 60°C for longer probes, primers or oligonucleotides. 

12 
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Stringent conditions may also be achieved with the addition of destabilizing agents, 

such as formamide. 

The amino acid sequences are presented in the amino to carboxy direction, 
from left to right. The amino and carboxy groups are not presented in the sequence. 
5 The nucleotide sequences are presented by single strand only, in the 5 ' to 3' direction, 
from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 
Polynucleotides 

10 The present invention provides purified and isolated polynucleotides (e.g. , 

DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) that 
encode unknown G protein-coupled receptors heretofore termed novel GPCRs, or 
nGPCRs. These genes are described herein and designated herein collectively as 
nGPCR-x (where x is I, 3, 4, 5, 9, 11,12, 14, 15, 18, 16, 17, 20, 21, 22, 24, 27, 28, 
31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 53, 54, 55, 56, 57, 58, 59, or 60). That is, these 
genes are described herein and designated herein as nGPCR-1 (also referred to as 
beGPCR-1), nGPCR-3 (also referred to as beGPCR-3), nGPCR-4 (also referred to as 
beGPCR-4), nGPCR-5 (also referred to as beGPCR-5 and TL-GPCR-5), nGPCR-9 
20 (also referred to as beGPCR-9), nGPCR-1 1 (also referred to as beGPCR-1 1), nGPCR- 
12 (also referred to as beGPCR-12), nGPCR-14 (also referred to as beGPCR-14), 
nGPCR-1 5 (also referred to as beGPCR-1 5), nGPCR-1 8 (also referred to as beGPCR- 
18), nGPCR-16 (also referred to as beGPCR-16), nGPCR-17 (also referred to as 
beGPCR-17), nGPCR-20 (also referred to as beGPCR-20), nGPCR-21 (also referred 
25 to as beGPCR-21), nGPCR-22 (also referred to as beGPCR-22), nGPCR-24 (also 

referred to as beGPCR-24), nGPCR-27 (also referred to as beGPCR-27), nGPCR-28 
(also referred to as beGPCR-28), nGPCR-3 1 (also referred to as beGPCR-3 V), 
nGPCR-32 (also referred to as beGPCR-32), nGPCR-33 (also referred to as beGPCR- 
33), nGPCR-34 (also referred to as beGPCR-34), nGPCR-35 (also referred to as 
beGPCR-35), nGPCR-36 (also referred to as beGPCR-36), nGPCR-37 (also referred 
to as beGPCR-37), nGPCR-38 (also referred to as beGPCR-38), nGPCR-40 (also 
referred to as beGPCR-40), nGPCR-4 1 (also referred to as beGPCR-4 1), nGPCR-53, 
nGPCR-54, nGPCR-55, nGPCR-56, nGPCR-57, nGPCR-58, nGPCR-59, and 
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nGPCR-60. Table 1 below identifies the novel gene sequence nGPCR-x designation, 
the SEQ ID NO: of the gene sequence, the SEQ ID NO: of the polypeptide encoded 
thereby, and the U.S. Provisional Application in which the gene sequence has been 
disclosed. 
Table 1 
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A= Ser. No. 60/165,838 
B= Ser. No. 60/166,071 
C= Ser. No. 60/166,678 
D= Ser. No. 60/173,396 
E= Ser. No. 60/184,129 
F= Ser. No. 60/188,114 
G= Ser. No. 60/185,421 
H= Ser. No 60/186,81 1 



Legend 



1- Ser. No. 60/186,530 
J- Ser. No. 60/207,094 
JC= Ser. No. 60/203,11 1 
L= Ser. No. 60/190,310 
M= Ser. No. 60/201,190 
N= Ser. No. 60/185554 
0= Ser. No. 60/190,800 
P= Ser. No. 60/198,568 



20 



When a specific nGPCR is identified (for example nGPCR-5), it is understood 
that only that specific nGPCR is being referred to. 

As described in Example 4 below, the genes encoding nGPCR-1 (nucleic acid 
sequence SEQ ID NO: 1, SEQ ID NO: 73, amino acid sequence SEQ ID NO: 2, SEQ 
ID NO:74), nGPCR-9 (nucleic acid sequence SEQ ID NO:9, SEQ ID NO:77, amino 
acid sequence SEQ ID NO: 10, SEQ ID NO:78), nGPCR-1 1 (nucleic acid sequence 
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SEQ ID NO:l 1, SEQ ID NO:79, amino acid sequence SEQ ID NO: 12, SEQ ID 
NO:80), nGPCR-16 (nucleic acid sequence SEQ ID NO: 21, SEQ ID NO:81, amino 
acid sequence SEQ ID NO: 22, SEQ ID NO:82), nGPCR-40 (nucleic acid sequence 
SEQ ID NO:53, SEQ ID NO:83, amino acid sequence SEQ ID NO:54, SEQ ID 

5 NO:84), nGPCR-54 (nucleic acid sequence SEQ ID NO:59, SEQ ID NO:85, ammo 
acid sequence SEQ ID NO:60, SEQ ID NO: 86), nGPCR-56 (nucleic acid sequence 
SEQ ID NO:63, SEQ ID NO:87, SEQ ID NO:89, amino acid sequence SEQ ID 
NO:64, SEQ ID NO: 88, SEQ ID NO:90), nGPCR-58 (nucleic acid sequence SEQ ID 
NO:67, SEQ ID NO:91, SEQ ID NO:93, amino acid sequence SEQ ID NO:68, SEQ 

10 ID NO: 92, SEQ ID NO:94) and nGPCR-3 (nucleic acid sequence SEQ ID NO:3, 
SEQ ID NO: 185, amino acid sequence SEQ ID N0:4, SEQ ID NO: 186) have been 
detected in brain tissue indicating that these n-GPCR-x proteins are neuroreceptors. 

The invention provides purified and isolated polynucleotides (e.g., cDNA, 
genomic DNA, synthetic DNA, RNA, or combinations thereof, whether single- or 

15 double-stranded) that comprise a nucleotide sequence encoding the amino acid 

sequence of the polypeptides of the invention. Such polynucleotides are useful for 
recombinantly expressing the receptor and also for detecting expression of the 
receptor in cells (e.g., using Northern hybridization and in situ hybridization assays). 
Such polynucleotides also are useful in the design of antisense and other molecules 

20 for the suppression of the expression of nGPCR-x in a cultured cell, a tissue, or an 
animal; for therapeutic purposes; or to provide a model for diseases or conditions 
characterized by aberrant nGPCR-x expression. Specifically excluded from the 
definition of polynucleotides of the invention are entire isolated, non-recombinant 
native chromosomes of host cells. A preferred polynucleotide has the sequence of the 

25 sequence set forth in odd numbered sequences ranging from SEQ ID NO: 1 to SEQ 
ID NO: 93 and SEQ ID NO: 1 85, which correspond to naturally occurring nGPCR-x 
sequences. It will be appreciated that numerous other polynucleotide sequences exist 
that also encode nGPCR-x having the sequence set forth in even numbered sequences 
ranging from SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID NO: 1 86, due to the 

30 well-known degeneracy of the universal genetic code. 

The invention also provides a purified and isolated polynucleotide comprising 
a nucleotide sequence that encodes a mammalian polypeptide, wherein the 
polynucleotide hybridizes to a polynucleotide having the sequence set forth in odd 
numbered sequences ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID 
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NO: 185 or the non-coding strand complementary thereto, under the following 
hybridization conditions: 

(a) hybridization for 16 hours at 42 U C in a hybridization solution comprising 
50% formamide, 1% SDS, 1 M NaCl, 10% dextran sulfate; and 
5 (b) washing 2 times for 30 minutes each at 60°C in a wash solution comprising 

0.1% SSC, 1% SDS. Polynucleotides that encode a human allelic variant are highly 
preferred. 

The present invention relates to molecules which comprise the gene sequences 
that encode the nGPCRs; constructs and recombinant host cells incorporating the gene 
10 sequences; the novel GPCR polypeptides encoded by the gene sequences; antibodies 
to the polypeptides and homologs; kits employing the polynucleotides and 
polypeptides, and methods of making and using all of the foregoing. In addition, the 
present invention relates to homologs of the gene sequences and of the polypeptides 
and methods of making and using the same. 

15 Genomic DNA of the invention comprises the protein-coding region for a 

polypeptide of the invention and is also intended to include allelic variants thereof Tt 
is widely understood that, for many genes, genomic DNA is transcribed into RNA 
transcripts that undergo one or more splicing events wherein intron (i.e., non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 

20 spliced by alternative mechanisms, and therefore be subject to removal of different 
RNA sequences but still encode a nGPCR-x polypeptide, are referred to in the art as 
splice variants which are embraced by the invention. Splice variants comprehended 
by the invention therefore are encoded by the same original genomic DNA sequences 
but arise from distinct mRNA transcripts. Allelic variants are modified forms of a 

25 wild-type gene sequence, the modification resulting from recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild type genes, are naturally occurring sequences (as 
opposed to non-naturally occurring variants that arise from in vitro manipulation). 
The invention also comprehends cDNA that is obtained through reverse 

30 transcription of an RNA polynucleotide encoding nGPCR-x (conventionally followed 
by second strand synthesis of a complementary strand to provide a double-stranded 
DNA). 
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Preferred DNA sequences encoding human nGPCR-x polypeptides are set out 

c ccn rn MO 1 to SEO ID NO: 93 and SEQ 
in odd numbered sequences rang.ng from SEQ ID NO. 1 to ity 

ID NO- 185 A preferred DNA of the invention comprises a double stranded 

molecule along with the commentary molecule (the "non-coding strand" or 

-complemenO having a sequence unambiguously deducb.e from the coding strand 

accord,ng to Watson-Cnck base-painng ru.es for DNA. A.so preferred are other 

polynucleotides encod.ng the nOPCR-x po.ypept.de of even numbered sequences 

f nm SEO ID NO- 2 to SEQ ID MO: 94 and SEQ ID NO: 186, which differ in 

ranging from bbg iu inw. ^ o v n 

sequence from the polynucleotides of odd numbered sequences ranging from SEQ ID 
NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185, by v.rtue of the well-known 
degeneracy of the universal nuclear genetic code. 

The invention further embraces other species, preferably mammalian, 
Homologs of the human nGPCR-x DNA. Specie, homologs, sometimes referred to as 
-orthologs " in genera., share at least 35%, at least 40%, at least 45%, at least 50* at 
, least 60%,, .east 65%, at .east 70%. at .east 75%. a. least 80%, at .east 85%, at .east 
90% a, .east 95%, at least 98%, or at least 99% homo.ogy with human DNA of the 
invention. Generally, percent sequence "homology" with respect to polynucleotides 
of the invention may be calculated as the percentage of nucleotide bases ,n the 
ca nd,da,e sequence that are ident.ca. to nucleotides in the nGPCR-x sequence set 
0 forth in odd numbered sequences ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and 
SEQ ID NO: 185, after aligning the sequences and mtroducng gaps, if necessary, to 
achieve the maximum percent sequence identity. 

Polynucleotides of the invention permit identification and .solation of 
pol vnuc.eotides encoding related nGPCR-x po.ypept.des, such as human allelic 
25 varian ts and species homologs, by well-known techniques including Southern and/or 
Northern hybridization, and polymerase chain reaction (PGR). Examples of relate 
pdynucleotides include human and non-human genomic sequences, including allelic 
variants as well as polynucleotides encoding polypeptides homologous to nGPCR-x 
and structurally re.ated polypeptides sharing one or more biological, immunological, 
30 and/or physical propert.es of nGPCR-x. Non-human species genes encoding proteins 
Homologous to nGPCR-x can a.so be identified by Southern and/or PCR analys.s and 
are useful in animal models for nGPCR-x disorders. Knowledge of the sequence of a 
human nGPCR-x DNA a.so makes poss.b.e through use of Southern hybridization or 
polymerase chain reaction (PCR) the identification of genomic DNA sequences 



17 



WO \\ 1/36473 



PCT/LJS00/315S1 



encoding nGPCR-x expression control regulatory sequences such as promoters, 
operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also useful in hybridization assays to detect the capacity of cells to express nGPCR-x. 
Polynucleotides of the invention may also provide a basis for diagnostic methods 
useful for identifying a genetic alteration(s) in a nGPCR-x locus that underlies a 
disease state or states, which information is useful both for diagnosis and for selection 
of therapeutic strategies. 

According to the present invention, the nGPCR-x nucleotide sequences 
disclosed herein may be used to identify homologs of the nGPCR-x, in other animals, 
including but not limited to humans and other mammals, and invertebrates. Any of 
the nucleotide sequences disclosed herein, or any portion thereof, can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs, using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 
preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with nGPCR-x sequences can be identified. 

The disclosure herein of full-length polynucleotides encoding nGPCR-x 
polypeptides makes readily available to the worker of ordinary skill in the art every 
possible fragment of the full-length polynucleotide. 

One preferred embodiment of the present invention provides an isolated 
nucleic acid molecule comprising a sequence homologous to odd numbered sequences 
selected from the group consisting of SEQ ID NO: 1 to SEQ ID NO:93, SEQ ID NO: 
185, and fragments thereof. Another preferred embodiment provides an isolated 
nucleic acid molecule comprising a sequence selected from the group of odd 
numbered sequences consisting of SEQ ID NO: 1 to SEQ ID NO: 93, SEQ ID NO: 
185 and fragments thereof. 

As used in the present invention, fragments of nGPCR-x-encoding 
polynucleotides comprise at least 10, and preferably at least 12, 14, 16, 18, 20, 25, 50, 
or 75 consecutive nucleotides of a polynucleotide encoding nGPCR-x. Preferably, 
fragment polynucleotides of the invention comprise sequences unique to the nGPCR- 
x-encoding polynucleotide sequence, and therefore hybridize under highly stringent or 
moderately stringent conditions only (i.e., "specifically") to polynucleotides encoding 
nGPCR-x (or fragments thereof). Polynucleotide fragments of genomic sequences of 
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the mvent.cn compnse not only sequences unique to the coding region, but also 
include fragments of the full-length sequence derived from .nitons, regulatory 
reoions and/or other non-trans.ated sequences. Sequences unique to po.ynucleotides 
offce invention are recognizable through sequence comparison to other known 
5 polynucleotides, and can be identified through use of alignment programs routinely 
utilized in the art, e.g., those made available .n public sequence databases. Such 
sequences also are recognizable from Southern hybridization analyses to determine 
the number of fragments of genomic DNA to which a polynucleotide will hybnd.ze. 
Polynucleotides of the invention can be labeled in a manner that permits the.r 
,0 detection, including rad.oact.ve, fluorescent, and enzymatic labeling. 

Fragment polynucleotides are particularly useful as probes for detection of 
full-length or fragments of nGPCR-x polynucleotides. One or more polynucleotides 
can be included in kits that are used to detect the presence of a polynucleot.de 
encoding nGPCR-x, or used to detect variations in a polynucleotide sequence 

15 encoding nGPCR-x. 

The invention also embraces DN As encoding nGPCR-x polypeptides that 
hybrids under moderately stringent or high stringency conditions to the non-coding 
strand or complement, of the polynucleotides set forth in odd numbered sequences 
ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185. 
20 Exemplary highly stringent hybrid.zat.on conditions are as follows: 

hybnd.zat.on at 42°C in a hybridization solution compris.ng 50% formam.de, 1% 
SDS 1 M NaCl, 10% Dextran sulfate, and washing tw.ee for 30 minutes at 60°C m a 
wash solution comprising 0.1 X SSC and 1% SDS. I. is understood in the art that 
conditions of equivalent stringency can be achieved through variation of temperature 
25 and buffer, or salt concentrate as described Ausubel et al. (Eds), P^otocplsm 

MolecuJar_BipJogy., John W.ley & Sons (1994), pp. 6.0.3 to 6.4.10. Modifications ,n 
hyb „d.zat,on conditions can be empirically determined or precisely calculated based 
on the length and the percentage of guanosine/cytos.ne (GC) base pa.nng of the 
probe The hybridization conditions can be calculated as described in Sambrook, et 
30 fl ,, (Ed ,), MoIecuJarOoj^^ Cold Spring Harbor Laboratory 

Press- Cold Spnng Harbor, New York (1989), pp. 9.47 to 9.51. 

With the knowledge of the nucleot.de sequence information disclosed in the 
present invention, one skilled in the art can identify and obtain nucleot.de sequences 
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which encode nGPCR-x from different sources (i.e., different tissues or different 
organisms) through a variety of means well known to the skilled artisan and as 
disclosed by, for example, Sambrook et al., "Molecular cloning: a laboratory manual", 
Second Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), which is 
5 incorporated herein by reference in its entirety. 

For example, DNA that encodes nGPCR-x may be obtained by screening of 
mRNA, cDNA, or genomic DNA with oligonucleotide probes generated from the 
nGPCR-x gene sequence information provided herein. Probes may be labeled with a 
detectable group, such as a fluorescent group, a radioactive atom or a 
10 chemi luminescent group in accordance with procedures known to the skilled artisan 
and used in conventional hybridization assays, as described by, for example, 
Sambrook et al. 

A nucleic acid molecule comprising any of the nGPCR-x nucleotide sequences 
described above can alternatively be synthesized by use of the polymerase chain 

15 reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 
nucleotide sequences provided herein. See U.S. Patent Numbers 4,683,195 to Mullis 
et al. and 4,683,202 to Mullis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
sequence has not been previously purified and is present only in a single copy in a 

20 particular sample. The method can be used to amplify either single- or double- 
stranded DNA. The essence of the method involves the use of two oligonucleotide 
probes to serve as primers for the template-dependent, polymerase mediated 
replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 

25 are well known to those skilled in the art. Examples of these techniques are found in, 
for example, Berger et al., Guide to Molecular Cloning Techniques, Methods in 
Enzymology 152, Academic Press, Inc., San Diego, CA (Berger), which is 
incorporated herein by reference in its entirety. 

Automated sequencing methods can be used to obtain or verify the nucleotide 

30 sequence of nGPCR-x. The nGPCR-x nucleotide sequences of the present invention 
are believed to be 100% accurate. However, as is known in the art, nucleotide 
sequence obtained by automated methods may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90%, more typically 
at least about 95% to at least about 99.9% identical to the actual nucleotide sequence 
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of a given nuc.eic acid molecule. The actua, sequence may be more precisely 
determined us,ng manual sequencing methods, which are well known in the art. An 
error in a sequence which results in an insertion or deletion of one or more 
nuc ,eot,des may result in a frame shift in transition such that the predicted ammo 
acid sequence will differ from that wh.ch would be predicted from the actua. 

The nucleic acid molecules of the present invention, and fragments denved 
therefrom are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with certain disorders, as we., as for genetic mappmg. 

The polynucleotide sequence information provided by the invention makes 
possible ,arge-sca,e expression of the encoded polypeptide by techniques we., known 
and routinely practiced in the art. 

VeC,0, " S Another aspect of the present invention ,s directed to vectors, or recombinant 
expression vectors, composing any of the nuc.eic acid mo.ecu.es described above. 
Vectors are used herein either to amphfy DNA or R^A encoding nGPCR-x an^or o 
express DNA wh.ch encodes nGPCR-x. Preferred vectors include, but are not hrmted 
to p,asm,ds, phages, cosmids, ep.somes, v.a, particle, or viruses, and .ntegratab e 
DNA fragments („., fragments .ntegratable into the host genome by homologous 
combination). Preferred v.ra, panicles include, but are not hm.ted to, adenoviruses, 
baculcruses, parvoviruses, herpesviruses, poxviruses, adeno-associated viruses, 
Semhk, Forest viruses, vaccinia viruses, and retroviruses. Preferred expression 
vectors,nc,ude,butarenot,.m,t^^ 

Biotech). Other expression vectors include, but are not limited to, pSPOR^ 
vectors pGEM™ vectors (Promega), pPROEXvectors™ (LT1. Bethesda, MD), 
BluescHpt™ vectors (Stratagene), pQE™ vectors (Q.agen), P SE420™ (.nv.trogen), 

Expression constructs preferab.y comprise GPCR-x-encod,n g polynucleotides 
operatives UnKed to an endogenous or exogenous expression control DNA sequence 
Id a transition termmato, Expression control DNA sequences tnclud pr— 
enhancers, operators, and regu.atory e.ement b.nd.ng sites general.y, and a e t^ica.ly 
sel ected based on the expression systems in which the expression construct » to be 
utilized. Preferred promoter and enhancer sequences are generally selec ed for 
ability to increase gene expression, while operator sequences are generally selected 
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for the ability to regulate gene expression. Expression constructs of the invention 
may also include sequences encoding one or more selectable markers that permit 
identification of host cells bearing the construct. Expression constructs may also 
include sequences that facilitate, and preferably promote, homologous recombination 
5 in a host cell. Preferred constructs of the invention also include sequences necessary 
for replication in a host cell. 

Expression constructs are preferably utilized for production of an encoded 
protein, but may also be utilized simply to amplify a nGPCR-x-encoding 
polynucleotide sequence. In preferred embodiments, the vector is an expression 

10 vector wherein the polynucleotide of the invention is operatively linked to a 
polynucleotide comprising an expression control sequence. Autonomously 
replicating recombinant expression constructs such as plasmid and viral DNA vectors 
incorporating polynucleotides of the invention are also provided. Preferred 
expression vectors are replicable DNA constructs in which a DNA sequence encoding 

15 nGPCR-x is operably linked or connected to suitable control sequences capable of 
effecting the expression of the nGPCR-x in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it 
controls the transcription of the sequence. Amplification vectors do not require 

20 expression control domains, but rather need only the ability to replicate in a host, 
usually conferred by an origin of replication, and a selection gene to facilitate 
recognition of trans form ants. The need for control sequences in the expression vector 
will vary depending upon the host selected and the transformation method chosen. 
Generally, control sequences include a transcriptional promoter, an optional operator 

25 sequence to control transcription, a sequence encoding suitable mRNA ribosomal 

binding and sequences which control the termination of transcription and translation. 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the Pr and P L 

30 promoters of bacteriophage lambda (The bacteriophage Lambda, Hershey, A. D., Ed., 
Cold Spring Harbor Press, Cold Spring Harbor, NY (1973), which is incorporated 
herein by reference in its entirety; Lambda II, Hendrix, R. W., Ed., Cold Spring 
Harbor Press, Cold Spring Harbor, NY (1980), which is incorporated herein by 
reference in its entirety); the trp, recA, heat shock, and lacZ promoters of E. coli and 
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the SV40 early proper (Beno.st « ai. N.ure, 290. 304-310. W h,ch is 
incorp0 rated herem by reference tn te ent.rety, Add.t.ona, probers tnc.ud, but 
11,^^-^-^™^^^^°^ ter 

hemoglobm, human muscle creattne, and human metalothioncn. 

Add.tiona, regulatory sciences can also be included ,n preferred vec ors. 
Preferred examples of su.tab.e regu.atory seances are represented y the Shme- 
Dalgamo of the rephcase gene of the phage MS-2 and of the gene ell of 

h .ambda The Shtne-Dalgamo sequence may be d.rect.y followed by 
bacteriophage lambda, lne^nm b „rPPR x orotein 

DNA encoding -GPCR.x „d >>» «P«»°° °< lte " 

Moreover, expression -» - - *" 

„ 1 sc« .f*« »-r»^ •»» « ,ls ^ «»•"»""">" ° f """" 

» 8 ~ »„ te ^ — - ,« - -~ - 

, tf,e art and described in Sambiookeifl/., 

A n ongin of rephcat.on can a,so be prov.ded e.ther by construct,™ of the 

chromosoma! replication mechamsm. If the vector ts tntegrated ,nto the host 

riome, Z latter ^to^ 
,„ whtch conta.n vira, ongms of rephcatton, one sailed m the ^ 
' mammahan cel.. by the method of co-transformat.on w.h a 

TgPCR-x DNA. An examp>e of a su.table marXer i, d.hydrofolate reductase (DHFR) 

Nucleotide sequences encoding GPCR-x may 
2! DNA i„ .ecrd^ce „„n convent «™ »^'< " 

„ed-=«ded « for lieuion. —ion en.vro. d, B C,on ,o prov.d, 

,11, ,0 avoid „nde S ,de raW e fining, end H^ion - „propr»-e ^ 

L,« ro, »- - — J » s '™ b '» k " s „ 
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Host cells 

According to another aspect of the invention, host cells are provided, 
including prokaryotic and eukaryotic cells, comprising a polynucleotide of the 
invention (or vector of the invention) in a manner that permits expression of the 
5 encoded nGPCR-x polypeptide. Polynucleotides of the invention may be introduced 
into the host cell as part of a circular plasmid, or as linear DNA comprising an 
isolated protein coding region or a viral vector. Methods for introducing DNA into 
the host cell that are well known and routinely practiced in the art include 
transformation, transfection, electroporation, nuclear injection, or fusion with carriers 

10 such as liposomes, micelles, ghost cells, and protoplasts. Expression systems of the 
invention include bacterial, yeast, fungal, plant, insect, invertebrate, vertebrate, and 
mammalian cells systems. 

The invention provides host cells that are transformed or transfected (stably or 
transiently) with polynucleotides of the invention or vectors of the invention. As 

15 stated above, such host cells are useful for amplifying the polynucleotides and also for 
expressing the nGPCR-x polypeptide or fragment thereof encoded by the 
polynucleotide. 

Ln still another related embodiment, the invention provides a method for 
producing a nGPCR-x polypeptide (or fragment thereof) comprising the steps of 

20 growing a host cell of the invention in a nutrient medium and isolating the 

polypeptide or variant thereof from the cell or the medium. Because nGPCR-x is a 
seven transmembrane receptor, it will be appreciated that, for some applications, such 
as certain activity assays, the preferable isolation may involve isolation of cell 
membranes containing the polypeptide embedded therein, whereas for other 

25 applications a more complete isolation may be preferable. 

According to some aspects of the present invention, transformed host cells 
having an expression vector comprising any of the nucleic acid molecules described 
above are provided. Expression of the nucleotide sequence occurs when the 
expression vector is introduced into an appropriate host cell. Suitable host cells for 

30 expression of the polypeptides of the invention include, but are not limited to, 

prokaryotes, yeast, and eukaryotes. If a prokaryotic expression vector is employed, 
then the appropriate host cell would be any prokaryotic cell capable of expressing the 
cloned sequences. Suitable prokaryotic cells include, but are not limited to, bacteria 
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of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, Streptomyces, and 

Staphylococcus. 

If an eukaryotic expression vector is employed, then the appropriate host cel. 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 

5 eukaryotic cells are cells of higher eukaryotes. Suiuble eukaryotic cells include, but 
are no, limited to, non-human mammalian tissue culture cells and human tissue 
culture cells. Preferred host cells include, but are not limited to, insect cells, HeLa 
cells Chinese hamster ovary cells (CHO cells), African green monkey kidney cells 
(COS cells) human 293 cells, and munne 3T3 fibroblasts. Propagation of such cells 

10 in cel. culture has become a routine procedure (see, Tissue Culture, Academic Press, 
Kn.se and Patterson, eds. (1973), which is incorporated herein by reference ,n its 
entirety). 

In addition, a yeas, host may be employed as a host cel.. Preferred yeast cells 
include but are not limited to, the genera Saccharoses, P.Ma, and Kluveromyces. 
Preferred yeast hosts are 5. cere.tstae and P. pastor*. Preferred yeast vectors can 
contain an origin of replication sequence from a 2T yeast p.asm.d, an autonomously 
replication sequence (ARS), a promoter region, sequences for polyadenylat.on, 
sequences for transcription termination, and a selectable marker gene. Shuttle vectors 
for replication in both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 
expression system (see, Luckow et al., BiofTecUnology, 1988, 6, 47, Bacu.ov.rus 
Expression Vectors: A Laboratory Manua., O'Riclly e, al. (Eds.), W.H. Freeman and 
Company New York, 1992, and U.S. Patent No. 4,879,236, each of wh.ch ,s 
mcorporated here.n by reference in its ent.rety). In addition, the MAXBAC™ 
complete bacu.ov.rus expression system (Invitrogen) can, for example, be used for 
production in insect cells. 

Host cells of the invent.on are a valuable source of immunogen for 
development of antibodies specifically immunoreact.ve with nGPCR-x. Host cells of 
the invention are also useful in methods for the large-scale production of nGPCR-x 
polypeptides wherein the cel.s are grown in a suitable culture medium and the desired 
polypeptide products are isolated from the ce.ls, or from the medium in which the 
cells are grown, by purification methods known in the art, e.g.. conventional 
chromatographic methods including immunoaffimty chromatography, receptor 
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affinity chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 
high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is 
5 expressed and purified as a fusion protein having a specific tag, label, or chelating 
moiety that is recognized by a specific binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fusion protein. 
Cleavage of the fusion component may produce a form of the desired protein having 
additional amino acid residues as a result of the cleavage process. 

to Knowledge of nGPCR-x DNA sequences allows for modification of cells to 

permit, or increase, expression of endogenous nGPCR-x. Cells can be modified (e.g., 
by homologous recombination) to provide increased expression by replacing, in 
whole or in part, the naturally occurring nGPCR-x promoter with all or part of a 
heterologous promoter so that the cells express nGPCR-x at higher levels. The 

15 heterologous promoter is inserted in such a manner that it is operatively linked to 
endogenous nGPCR-x encoding sequences. (See, for example, PCT International 
Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and 
PCT International Publication No. WO 91/09955.) It is also contemplated that, in 
addition to heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, 

20 and the multifunctional CAD gene which encodes carbamoyl phosphate synthase, 
aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted 
along with the heterologous promoter DNA. If linked to the nGPCR-x coding 
sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the nGPCR-x coding sequences in the cells. 

25 Knock-outs 

The DNA sequence information provided by the present invention also makes 
possible the development (e.g., by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989), which is incorporated herein 
by reference) of animals that fail to express functional nGPCR-x or that express a 
30 variant of nGPCR-x. Such animals (especially small laboratory animals such as rats, 
rabbits, and mice) are useful as models for studying the in vivo activities of nGPCR-x 
and modulators of nGPCR-x. 
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Antisense 

Also made available by the invention are ant-sense polynucleotides that 
r ecogn,ze and hybridize to polynucleotide, encoding nGPCR-x. FuH-length and 
fragment ant^sense polynucleotides a, provided. Fragment antisense molecules o 
the tnvent.cn include (i) those that specifically recognize and hybnd.ze to nGPCR. 
RNA (as determined by sequence comparison of DNA encoding A 
eroding other known molecules). Identification of sequences umque to nGPCR-x 
encoding polynucleotide can be deduced through use of any publicly ava.lab.e 
sequence database, and/or through use of commercial* available sequence 
, eompanson programs. After identification of the desired sequences, isolation through 
restriction digestion or amplification using any of the various polymerase cha.n 
reaction techniques well known in the art can be performed. A« 
po.ynucleotides are particularly relevant to regulating expression of nGPCR-x by 
those cells expressing nGPCR-x mRNA. 

Antisense nucleic acds (preferably 10 to 30 base-pair oligonucleotides) 
capable of specifically binding to nGPCR-x expression control sequences or nGPCR- 
x L are introduced into cells (e*. by a v.ra, vector or colloidal dispersion system 
SU ch as a liposome). The antisense nucleic acid binds to the nGPCR-x target 
nuc ,eotide sequence in the eel, and prevents transcription and/or translat.o n c > *e 
20 target sequence. Phosphoroth.oate and methylphosphonate antisense ohgonuc.eot.des 
JspecficaUy contemplated for therapeutic use by the invention. The antisense 
oligonucleotide, may be further modified by adding po,y-L-lysine, transferrin 
po.ylysme, or cholesterol moieties at their 5' end. Suppression of nGPCR-x 
expression at either the transcription, or transitional level ,s useful to generate 
25 cehular or an.mal models for diseases/conditions charactered by aberrant nGPCR-x 
expression. 

Anusense oligonucleotides, or fragments of odd numbered nucleotide 
sequences ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185 or 
sequences complementary or homologous thereto, denved from the nucleof e 

i- rDPD v w ireful as diagnostic tools 
,0 sequences of the present invention encoding nGPCR-x are useful as g 

• „ i„ various tissues For example, tissue can be probed in 
for probing gene expression in various tissues, ru 

Sltu w,th oligonucleotide probes carrying detectable groups by conventional 
autoradiography techniques to investigate native expression of this enzyme or 
pathologica, conditions relating thereto. Antisense oligonucleotides are preferably 
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directed to regulatory regions of odd numbered nucleotide sequences ranging from 
SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185, ormRNA corresponding 
thereto, including, but not limited to, the initiation codon, TATA box, enhancer 
sequences, and the like. 
5 Transcription factors 

The nGPCR-x sequences taught in the present invention facilitate the design 
of novel transcription factors for modulating nGPCR-x expression in native cells and 
animals, and cells transformed or transfected with nGPCR-x polynucleotides. For 
example, the Cys2-His2 zinc finger proteins, which bind DNA via their zinc finger 

10 domains, have been shown to be amenable to structural changes that lead to the 
recognition of different target sequences. These artificial zinc finger proteins 
recognize specific target sites with high affinity and low dissociation constants, and 
are able to act as gene switches to modulate gene expression. Knowledge of the 
particular nGPCR-x target sequence of the present invention facilitates the 

15 engineering of zinc finger proteins specific for the target sequence using known 

methods such as a combination of structure-based modeling and screening of phage 
display libraries (Segal et aL, Proc. Natl. Acad. Sci. (USA) 96:2758-2763 (1999); Liu 
et al, Proc. Natl. Acad. Sci. (USA) 94:5525-5530 (1997); Greisman et aL, Science 
275:657-661 (1997); Choo et aL, J. Mol. Biol. 273:525-532 (1997)). Each zinc finger 

20 domain usually recognizes three or more base pairs. Since a recognition sequence of 
18 base pairs is generally sufficient in length to render it unique in any known 
genome, a zinc finger protein consisting of 6 tandem repeats of zinc fingers would be 
expected to ensure specificity for a particular sequence (Segal et aL) The artificial 
zinc finger repeats, designed based on nGPCR-x sequences, are fused to activation or 

25 repression domains to promote or suppress nGPCR-x expression (Liu et aL) 

Alternatively, the zinc finger domains can be fused to the TATA box-binding factor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et aL, Proc. Natl. 
Acad. Sci. (USA) 94:3616-3620 ( 1997). Such proteins and polynucleotides that 

30 encode them, have utility for modulating nGPCR-x expression in vivo in both native 
cells, animals and humans; and/or cells transfected with nGPCR-x-encoding 
sequences. The novel transcription factor can be delivered to the target cells by 
transfecting constructs that express the transcription factor (gene therapy), or by 
introducing the protein. Engineered zinc finger proteins can also be designed to bind 
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RNA sequences for use in therapeutics as alternatives to anhsense or catalytic RNA 
methods (McCol. et al, Proc. Natl. Acad. Sc, (USA) 96:9521-9526 (1997); Wu et 
al., Proc. Natl. Acad. Sci. (USA) 92:344-348 (1995)). The present invention 
contemplates methods of designing such transcription factors based on the gene 
sequence of the invention, as well as customed zinc finger proteins, that are useful 
to modulate nGPCR-x expression in cells (native or transformed) whose genetic 
complement includes these sequences. 
Polypeptides 

The invention also provides purified and isolated mammalian nGPCR-x 
polypeptides encoded by a polynucleotide of the invention. Presently preferred is a 
human nGPCR-x polypeptide composing the amino acid sequence set out >n even 
numbered sequences ranging from SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID 
NO- 186 or fragments thereof comprising an epitope specific to the polypeptide. By 
"epitope specific to" is meant a portion of the nGPCR receptor that is recognizable by 
an antibody that is specific for the nGPCR, as defined in detail below. 

Although the sequences provided are particular human sequences, the 
invention is intended to include with.n its scope other human allelic variants; non- 
human mammalian forms of nGPCR-x, and other vertebrate forms of nGPCR-x. 
It w.11 be appreciated that extracellular epitopes are particularly useful for 
, generating and screening for antibodies and other binding compounds that bind to 
receptors such as nGPCR-x. Thus, in another preferred embodiment, the invention 
provides a punfied and isolated polypeptide comprising at least one extracellular 
domain (e.g., the N-terminal extracellular domain or one of the three extracellular 
loops) of nGPCR-x. Punfied and isolated polypeptides composing the N-term,nal 
5 extracellular domain of nGPCR-x are highly preferred. Also preferred is a purified 
and isolated polypeptide comprising a nGPCR-x fragment selected from the group 
consisting of the N-terminal extracellular domain of nGPCR-x, transmembrane 
domains of nGPCR-x, an extracellular loop connecting transmembrane domains of 
nGPCR-x. an intracellular loop connecting transmembrane domains of nGPCR-x, the 
S0 C-termmal cytoplasmic region of nGPCR-x, and fusions thereof. Such fragments 
may be continuous portions of the native receptor. However, it will also be 
appreciated that knowledge of the nGPCR-x gene and protein sequences as provided 
herein permits recombining of various domains that are not contiguous in the native 
protein. Using a FORTRAN computer program called "tmtrest.all" [Parodi et al, 
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Comput. Appl. Biosci. 5:527-535 (1994)], nGPCR-x was shown to contain 
transmembrane-spanning domains. 

The invention also embraces polypeptides that have at least 99%, at least 95%, 
at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55% or at least 50% identity and/or homology to the preferred 
polypeptide of the invention. Percent amino acid sequence "identity" with respect to 
the preferred polypeptide of the invention is defined herein as the percentage of amino 
acid residues in the candidate sequence that are identical with the residues in the 
nGPCR-x sequence after aligning both sequences and introducing gaps, if necessary, 
to achieve the maximum percent sequence identity, and not considering any 
conservative substitutions as part of the sequence identity. Percent sequence 
"homology" with respect to the preferred polypeptide of the invention is defined 
herein as the percentage of amino acid residues in the candidate sequence that are 
identical with the residues in the nGPCR-x sequence after aligning the sequences and 
introducing gaps, if necessary, to achieve the maximum percent sequence identity, 
and also considering any conservative substitutions as part of the sequence identity. 

In one aspect, percent homology is calculated as the percentage of amino acid 
residues in the smaller of two sequences which align with identical amino acid residue 
in the sequence being compared, when four gaps in a length of 100 amino acids may 
be introduced to maximize alignment [Dayhoff, in Atlas of Protein Sequence and 
Structure , Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herein by reference]. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. Use of mammalian host cells is expected to 
provide for such post-translational modifications (e.g., glycosylation, truncation, 
Iipidation, and phosphorylation) as may be needed to confer optimal biological 
activity on recombinant expression products of the invention. Glycosylated and non- 
glycosylated forms of nGPCR-x polypeptides are embraced by the invention. 

The invention also embraces variant (or analog) nGPCR-x polypeptides. In 
one example, insertion variants are provided wherein one or more amino acid residues 
supplement a nGPCR-x amino acid sequence. Insertions may be located at either or 
both termini of the protein, or may be positioned within internal regions of the 
nGPCR-x amino acid sequence. Insertional variants with additional residues at either 
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i j fnrpvamnle fusion proteins and proteins including amino 
or both termini can include, for example, iusion p. 

acid tags or labels. 

Insertion vanants include nGPCR-x polypeptides wherein one or more ammo 
acid residues are added to a nGPCR-x acid sequence or to a biologically active 

fragment thereof. 

Vanant products of the invention also include mature nGPCR-x products, ,e., 
nGPCR-x products wherein leader or signal sequences are removed, with additional 
am.no terminal residues. The additional amino terminal residues may be derived 
from another prote.n, or may include one or more residues that are not identifiable as 
be,ng derived from specific proteins. nGPCR-x products with an additional 
methionine residue at position -1 (Me, '-nGPCR-x) are contemplated, as are variants 
with additional methionine and lysine residues at positions -2 and -1 
(Met-.Lys'-nGPCR-x). Vanants of nGPCR-x w„h additional Met, Met-Lys, Lys 
residues (or one or more basic residues in general) are particularly useful for 
enhanced recombinant protein production in bacterial host cells. 

The invention also embraces nGPCR-x vanants having additional am.no acd 
residues that result from use of specific expression systems. For example, use of 
commercially avai.able vectors that express a desired polypeptide as part of a 
glu ,a,h,one-S-transferase (GST) fus.on product provides the desired polypeptide 
having an additional glycine residue at position -1 after cleavage of the GST 
component from the des.red polypeptide. Variants that result from expression ,n 
other vector systems are also contemplated. 

msertional vanants also include fusion proteins wherein the amino term.nus 
and/or the carboxy term.nus of nGPCR-x is/are fused to another polypeptide. 

in another aspect, the invention provides deletion vanants where.n one or 
m0 re amino acd residues in a nGPCR-x polypeptide are removed. Deletions can be 
effected at one or both termini of the nGPCR-x polypeptide, or w.th removal of one or 
more non-term.na. amino acid res.dues of nGPCR-x. Deletion vanants, therefore, 
include all fragments of a nGPCR-x polypept.de. 

The invention also embraces polypeptide fragments of the even numbered 
sequences ranging from SEQ ID NO: 2 ,o SEQ ID NO: 94 and SEQ ID NO: .86 
where,n the fragments ma.nta.n biological (, g „ l.gand binding and/or intracellular 
signaling) immunological properties of a nGPCR-x polypeptide. 



31 





W O 01/36473 



PCT/US00/31581 



In one preferred embodiment of the invention, an isolated nucleic acid 
molecule comprises a nucleotide sequence that encodes a polypeptide comprising an 
amino acid sequence homologous to even numbered sequences selected from the 
group consisting of: SEQ TD NO:2 to SEQ ID NO:94, SEQ ID NO: 186, and 
5 fragments thereof, wherein the nucleic acid molecule encoding at least a portion of 
nGPCR-x. In a more preferred embodiment, the isolated nucleic acid molecule 
comprises a sequence that encodes a polypeptide comprising even numbered 
sequences selected from the group consisting of SEQ ID NO:2 to SEQ ID NO: 94, 
SEQ ID NO: 1 86, and fragments thereof. 

10 As used in the present invention, polypeptide fragments comprise at least 5, 

10, 15, 20, 25, 30, 35, or 40 consecutive amino acids of the even numbered sequences 
ranging from SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID NO: 186. Preferred 
polypeptide fragments display antigenic properties unique to, or specific for, human 
nGPCR-x and its allelic and species homologs. Fragments of the invention having the 

15 desired biological and immunological properties can be prepared by any of the 
methods well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of nGPCR- 
x polypeptides. Substitution variants include those polypeptides wherein one or more 
amino acid residues of a nGPCR-x polypeptide are removed and replaced with 

20 alternative residues. In one aspect, the substitutions are conservative in nature; 
however, the invention embraces substitutions that are also non-conservative. 
Conservative substitutions for this purpose may be defined as set out in Tables 2, 3, or 
4 below. 



25 been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 



30 (from WO 97/09433, page 10, published March 13, 1997 (PCT/GB96/02 1 97, filed 
9/6/96), immediately below. 



Variant polypeptides include those wherein conservative substitutions have 
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Table 2 
Conservative Substitutions I 

SIDE CHAIN 
CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

I LV 

Polar - uncharged C S T M 

NQ 

Polar - charged D E 

K. R 

Aromatic H F W Y 

Other N Q D E 

Alternatively, conservative amino acids can be grouped as described in 
5 Lehninger, [ Biochemistry , Second Edition; Worth Publishers, Inc. NY, NY (1975), 
pp.7 1-77] as set out in Table 3, below. 

Table 3 
Conservative Substitutions II 

10 SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Non-polar (hydrophobic) 

A. Aliphatic: ALIVP 

B. Aromatic: F W 

C. Sulfur-containing: M 

D. Borderline: G 
Uncharged-polar 

A. Hydroxyl: .STY 

B. Amides: NQ 

C. Sulfhydryl: C 

D. Borderline: G 
Positively Charged (Basic): K R H 
Negatively Charged (Acidic): D E 

As still another alternative, exemplary conservative substitutions are set out in 
Table 4, below. 
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Table 4 

Conservative Substitutions III 

Original Residue Exemplary Substitution 

Ala (A) Val, Leu, He 

Arg (R) Lys, Gin, Asn 

Asn(N) Gin, His, Lys, Arg 

Asp (D) Glu 

Cys (C) Ser 

Gin (Q) Asn 

Glu(E) Asp 

His(H) Asn, Gin, Lys, Arg 

He (I) Leu, Val, Met, Ala, Phe, 
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Leu (L) 


He, Val, Met, Ala, Phe 


Lys (K) 


Arg, Gin, Asn 


Met (M) 


Leu, Phe. He 


Phe (F) 


Leu, Val, He, Ala 


Pro (P) 


Gly 


Ser (S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr (Y) 


Trp, Phe, Thr, Ser 


Val (V) 


He, Leu, Met, Phe, Ala 



It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino acid residues. By way of example, the modifications may be 
5 covalent in nature, and include for example, chemical bonding with polymers, lipids, 
other organic, and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues, or organs. Similarly, the 
invention further embraces nGPCR-x polypeptides that have been covalently 

10 modified to include one or more water-soluble polymer attachments such as 

polyethylene glycol, polyoxyethylene glycol, or polypropylene glycol. Variants that 
display ligand binding properties of native nGPCR-x and are expressed at higher 
levels, as well as variants that provide for constitutively active receptors, are 
particularly useful in assays of the invention; the variants are also useful in providing 

15 cellular, tissue and animal models of diseases/conditions characterized by aberrant 
nGPCR-x activity. 

In a related embodiment, the present invention provides compositions 
comprising purified polypeptides of the invention. Preferred compositions comprise, 
in addition to the polypeptide of the invention, a pharmaceutically acceptable (i.e., 

20 sterile and non-toxic) liquid, semisolid, or solid diluent that serves as a 

pharmaceutical vehicle, excipient, or medium. Any diluent known in the art may be 
used. Exemplary diluents include, but are not limited to, water, saline solutions, 
polyoxyethylene sorbitan monolaurate, magnesium stearate, methyl- and 
propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, dextrose, sorbitol, 

25 mannitol, glycerol, calcium phosphate, mineral oil, and cocoa butter. 

Variants that display ligand binding properties of native nGPCR-x and are 
expressed at higher levels, as well as variants that provide for constitutively active 
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receptors, are particularly useful in assays of the invention; the variants are also useful 
in assays of the invention and in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant nGPCR-x activity. 

The G protein-coupled receptor functions through a specific heterotrimeric 
guanine-nucleotide-binding regulatory protein (G-protein) coupled to the intracellular 
portion of the G protein-coupled receptor molecule. Accordingly, the G protein- 
coupled receptor has a specific affinity to G protein. G proteins specifically bind to 
guanine nucleotides. Isolation of G proteins provides a means to isolate guanine 
nucleotides. G Proteins may be isolated using commercially available anti-G protein 
antibodies or isolated G protein-coupled receptors. Similarly, G proteins may be 
detected in a sample isolated using commercially available detectable anti-G protein 
antibodies or isolated G protein-coupled receptors. 

According to the present invention, the isolated n-GPCR-x proteins of the 
present invention are useful to isolate and punfy G proteins from samples such as cell 
lysates. Example 15 below sets forth an example of isolation of G proteins using 
isolated n-GPCR-x proteins. Such methodolgy may be used in place of the use of 
commercially available anti-G protein antibodies which are used to isolate G proteins. 
Moreover, G proteins may be detected using n-GPCR-x proteins in place of 
commercially available detectable anti-G protein antibodies. Since n-GPCR-x 
proteins specifically bind to G proteins, they can be employed in any specific use 
where G protein specific affinity is required such as those uses where commercially 
available anti-G protein antibodies are employed. 
Antibodies 

Also comprehended by the present invention are antibodies (e.g. , monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
biflinctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-graHed antibodies, including compounds 
which include CDR sequences which specifically recognize a polypeptide of the 
invention) specific for nGPCR-x or fragments thereof Preferred antibodies of the 
invention are human antibodies that are produced and identified according to methods 
described in W093/1 1236, published June 20, 1993, which is incorporated herein by 
reference in its entirety. Antibody fragments, including Fab, Fab', F(ab') 2 > and F v , are 
also provided by the invention. The term "specific for," when used to describe 
antibodies of the invention, indicates that the variable regions of the antibodies of the 
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invention recognize and bind nGPCR-x polypeptides exclusively (i.e., are able to 
distinguish nGPCR-x polypeptides from other known GPCR polypeptides by virtue of 
measurable differences in binding affinity, despite the possible existence of localized 
sequence identity, homology, or similarity between nGPCR-x and such polypeptides). 
5 It will be understood that specific antibodies may also interact with other proteins (for 
example, S. aureus protein A or other antibodies in ELISA techniques) through 
interactions with sequences outside the variable region of the antibodies, and, in 
particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely 

10 practiced in the art. For a comprehensive discussion of such assays, see Harlow et at. 
(Eds.), Antibodies A Laboratory Manual ; Cold Spring Harbor Laboratory; Cold 
Spring Harbor, NY (1 988), Chapter 6. Antibodies that recognize and bind fragments 
of the nGPCR-x polypeptides of the invention are also contemplated, provided that 
the antibodies are specific for nGPCR-x polypeptides. Antibodies of the invention 

15 can be produced using any method well known and routinely practiced in the art. 

The invention provides an antibody that is specific for the nGPCR-x of the 
invention. Antibody specificity is described in greater detail below. However, it 
should be emphasized that antibodies that can be generated from polypeptides that 
have previously been described in the literature and that are capable of fortuitously 

20 cross-reacting with nGPCR-x {e.g., due to the fortuitous existence of a similar epitope 
in both polypeptides) are considered "cross-reactive" antibodies. Such cross-reactive 
antibodies are not antibodies that are "specific" for nGPCR-x. The determination of 
whether an antibody is specific for nGPCR-x or is cross-reactive with another known 
receptor is made using any of several assays, such as Western blotting assays, that are 

25 well known in the art. For identifying cells that express nGPCR-x and also for 

modulating nGPCR-x-ligand binding activity, antibodies that specifically bind to an 
extracellular epitope of the nGPCR-x are preferred. 

in one preferred variation, the invention provides monoclonal antibodies. 
Hybridomas that produce such antibodies also are intended as aspects of the 

30 invention. In yet another variation, the invention provides a humanized antibody. 
Humanized antibodies are useful for in vivo therapeutic indications. 

In another variation, the invention provides a cell-free composition comprising 
polyclonal antibodies, wherein at least one of the antibodies is an antibody of the 
invention specific for nGPCR-x. Antisera isolated from an animal is an exemplary 
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composition, as is a composition comprising an antibody fraction of an antisera that 
has been resuspended in water or in another diluent, excipient, or earner. 

In still another related embodiment, the invention provides an anti-idiotypic 
antibody specific for an antibody that is specific for nGPCR-x. 
5 It is well known that antibodies contain relatively small antigen binding 

domains that can be isolated chemically or by recombinant techniques. Such domains 
are useful nGPCR-x binding molecules themselves, and also may be reintroduced into 
human antibodies, or fused to toxins or other polypeptides. Thus, in still another 
embodiment, the invention provides a polypeptide comprising a fragment of a 
10 nGPCR-x-specific antibody, wherein the fragment and the polypeptide bind to the 

nGPCR-x. By way of non-limiting example, the invention provides polypeptides that 
are single chain antibodies and CDR-grafted antibodies. 

Non-human antibodies may be humanized by any of the methods known in the 
art. In one method, the non-human CDRs are inserted into a human antibody or 
15 consensus antibody framework sequence. Further changes can then be introduced 
into the antibody framework to modulate affinity or immunogenicity. 

Antibodies of the invention are useful for, e.g., therapeutic purposes (by 
modulating activity of nGPCR-x), diagnostic purposes to detect or quantitate nGPCR- 
x, and purification of nGPCR-x. Kits comprising an antibody of the invention for any 
20 of the purposes described herein are also comprehended. In general, a kit of the 

invention also includes a control antigen for which the antibody is immunospecific. 
Compositions 

Mutations in the nGPCR-x gene that result in loss of normal function of the 
nGPCR-x gene product underlie nGPCR-x-related human disease states. The 

25 invention comprehends gene therapy to restore nGPCR-x activity to treat those 

disease states. Delivery of a functional nGPCR-x gene to appropriate cells is effected 
ex vivo, in situ, or in vivo by use of vectors, and more particularly viral vectors (e.g., 
adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical 
DNA transfer methods {e.g., liposomes or chemical treatments). See, for example, 

30 Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional 
reviews of gene therapy technology see Fnedmznn, Science, 244: 1275-1281 (1989); 
Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). 
Alternatively, it is contemplated that in other human disease states, preventing the 
expression of, or inhibiting the activity of, nGPCR-x will be useful in treating disease 
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states. It is contemplated that antisense therapy or gene therapy could be applied to 
negatively regulate the expression of nGPCR-x. 

Another aspect of the present invention is directed to compositions, including 
pharmaceutical compositions, comprising any of the nucleic acid molecules or 
5 recombinant expression vectors described above and an acceptable carrier or diluent. 
Preferably, the carrier or diluent is pharmaceutically acceptable. Suitable carriers are 
described in the most recent edition of Remington's Pharmaceutical Sciences, A. 
Osol, a standard reference text in this field, which is incorporated herein by reference 
in its entirety. Preferred examples of such carriers or diluents include, but are not 

10 limited to, water, saline, Ringer's solution, dextrose solution, and 5% human serum 
albumin. Liposomes and nonaqueous vehicles such as fixed oils may also be used. 
The formulations are sterilized by commonly used techniques. 

Also within the scope of the invention are compositions comprising 
polypeptides, polynucleotides, or antibodies of the invention that have been 

15 formulated with, e.g., a pharmaceutically acceptable carrier. 

The invention also provides methods of using antibodies of the invention. For 
example, the invention provides a method for modulating ligand binding of a nGPCR- 
x comprising the step of contacting the nGPCR-x with an antibody specific for the 
nGPCR-x, under conditions wherein the antibody binds the receptor. 

20 GPCRs that may be expressed in the brain, such as nGPCR-x, provide an 

indication that aberrant nGPCR-x signaling activity may correlate with one or more 
neurological or psychological disorders. The invention also provides a method for 
treating a neurological or psychiatric disorder comprising the step of administering to 
a mammal in need of such treatment an amount of an antibody-like polypeptide of the 

25 invention that is sufficient to modulate ligand binding to a nGPCR-x in neurons of the 
mammal. nGPCR-x may also be expressed in other tissues, including but not limited 
to, peripheral blood lymphocytes, pancreas, ovary, uterus, testis, salivary gland, 
thyroid gland, kidney, adrenal gland, liver, bone marrow, prostate, fetal liver, colon, 
muscle, and fetal brain, and may be found in many other tissues. Within the brain, 

30 nGPCR-x mRNA transcripts may be found in many tissues, including, but not limited 
to, frontal lobe, hypothalamus, pons, cerebellum, caudate nucleus, and medulla. 
Tissues and brain regions where specific nGPCRs of the present invention are 
expressed are identified in the Examples below. 
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Kits 

The present invention is also directed to kits, including pharmaceutical kits. 
The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
5 invention as described above, as well as a negative control. The kit preferably 

comprises additional components, such as, for example, instructions, solid support, 
reagents helpful for quantification, and the like. 

In another aspect, the invention features methods for detection of a 
polypeptide in a sample as a diagnostic tool for diseases or disorders, wherein the 
10 method comprises the steps of: (a) contacting the sample with a nucleic acid probe 
which hybridizes under hybridization assay conditions to a nucleic acid target region 
of a polypeptide having the sequence of even numbered sequences ranging from SEQ 
ID NO: 2 to SEQ ID NO: 94 and SEQ ID NO: 1 86, said probe comprising the nucleic 
acid sequence encoding the polypeptide, fragments thereof, and the complements of 
15 the sequences and fragments; and (b) detecting the presence or amount of the 
probe:target region hybrid as an indication of the disease. 

In preferred embodiments of the invention, the disease is selected from the 
group consisting of thyroid disorders (e.g. thyreotoxicosis, myxoedema); renal failure; 
inflammatory conditions (e.g., Crohn's disease); diseases related to cell differentiation 
20 and homeostasis; rheumatoid arthritis; autoimmune disorders; movement disorders; 
CNS disorders (e.g., pain including migraine; stroke; psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, anxiety, generalized 
anxiety disorder, post-traumatic-stress disorder, depression, bipolar disorder, delirium, 
dementia, severe mental retardation; dyskinesias, such as Huntington's disease or 
25 Tourette's Syndrome; attention disorders including ADD and ADHD, and 

degenerative disorders such as Parkinson's, Alzheimer's; movement disorders, 
including ataxias, supranuclear palsy, etc.); infections, such as viral infections caused 
by H1V-1 or HIV-2; metabolic and cardiovascular diseases and disorders {e.g., type 2 
diabetes, obesity, anorexia, hypotension, hypertension, thrombosis, myocardial 
30 infarction, cardiomyopathies, atherosclerosis, etc.); proliferative diseases and cancers 
(e.g., different cancers such as breast, colon, lung, etc., and hyperproliferative 
disorders such as psoriasis, prostate hyperplasia, etc.); hormonal disorders (e.g., 
male/female hormonal replacement, polycystic ovarian syndrome, alopecia, etc.); and 
sexual dysfunction, among others. 
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As described above and in Example 4 below, the genes encoding nGPCR-1 
(nucleic acid sequence SEQ ID NO: 1, SEQ ID NO: 73, amino acid sequence SEQ ID 
NO: 2, SEQ ID NO:74), nGPCR-9 (nucleic acid sequence SEQ ID NO:9, SEQ ID 
NO:77, amino acid sequence SEQ ID NO: 1 0, SEQ ED NO:78), nGPCR-1 1 (nucleic 
5 acid sequence SEQ ID NO: 1 1 , SEQ ID NO:79, amino acid sequence SEQ ID NO: 1 2, 
SEQ ID NO:80), nGPCR-16 (nucleic acid sequence SEQ ID NO: 21, SEQ ID NO:81, 
amino acid sequence SEQ ID NO: 22, SEQ ED NO:82), nGPCR-40 (nucleic acid 
sequence SEQ ID NO:53, SEQ ID NO:83, amino acid sequence SEQ ID NO:54, SEQ 
ID NO:84), nGPCR-54 (nucleic acid sequence SEQ ID NO:59, SEQ ID NO:85, 

10 amino acid sequence SEQ ID NO:60, SEQ ID NO: 86), nGPCR-56 (nucleic acid 

sequence SEQ ID NO:63, SEQ ID NO:87, SEQ ID NO:89, amino acid sequence SEQ 
ID NO:64, SEQ ID NO: 88, SEQ ID NO:90), nGPCR-58 (nucleic acid sequence SEQ 
ID NO:67, SEQ ID NO 91, SEQ ID NO:93, amino acid sequence SEQ ID NO:68, 
SEQ ED NO: 92, SEQ ID NO:94) and nGPCR-3 (nucleic acid sequence SEQ ID 

15 NO:3, SEQ ID NO:185, amino acid sequence SEQ ID NO:4, SEQ ED NO: 186) have 
been detected in brain tissue indicating that these n-GPCR-x proteins are 
neuroreceptors. Kits may be designed to detect either expression of polynucleotides 
encoding these proteins or the proteins themselves in order to identify tissue as being 
neurological. For example, oligonucleotide hybridization kits can be provided which 

20 include a container having an oligonucleotide probe specific for the n-GPCR-x- 

specific DNA and optionally, containers with positive and negative controls and/or 
instructions. Similarly, PCR kits can be provided which include a container having 
primers specific for the n-GPCR-x-specific sequences, DNA and optionally, 
containers with size markers, positive and negative controls and/or instructions. 

25 Hybridization conditions should be such that hybridization occurs only with 

the genes in the presence of other nucleic acid molecules. Under stringent 
hybridization conditions only highly complementary nucleic acid sequences 
hybridize. Preferably, such conditions prevent hybridization of nucleic acids having 1 
or 2 mismatches out of 20 contiguous nucleotides. Such conditions are defined supra. 

30 The diseases for which detection of genes in a sample could be diagnostic 

include diseases in which nucleic acid (DNA and/or RNA) is amplified in comparison 
to normal cells. By "amplification" is meant increased numbers of DNA or RNA in a 
cell compared with normal cells. 
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The diseases that could be diagnosed by detection of nucleic acid in a sample 
preferably include central nervous system and metabolic diseases. The test samples 
suitable for nucleic acid probing methods of the present invention include, for 
example, cells or nucleic acid extracts of cells, or biological fluids. The samples used 
5 in the above-described methods will vary based on the assay format, the detection 
method and the nature of the tissues, cells or extracts to be assayed. Methods for 
preparing nucleic acid extracts of cells are well known in the art and can be readily 
adapted in order to obtain a sample that is compatible with the method utilized. 
Alternatively, immunoassay kits can be provided which have containers 
10 container having antibodies specific for the n-GPCR-x-protein and optionally, 
containers with positive and negative controls and/or instructions. 

Kits may also be provided useful in the identification of GPCR binding 
partners such as natural ligands or modulators (agonists or antagonists). Substances 
useful for treatment of disorders or diseases preferably show positive results in one or 
15 more in vitro assays for an activity corresponding to treatment of the disease or 
disorder in question. Substances that modulate the activity of the polypeptides 
preferably include, but are not limited to, antisense oligonucleotides, agonists and 
antagonists, and inhibitors of protein kinases. 
Methods of inducing immune response 
20 Another aspect of the present invention is directed to methods of inducing an 

immune response in a mammal against a polypeptide of the invention by 
administering to the mammal an amount of the polypeptide sufficient to induce an 
immune response. The amount will be dependent on the animal species, size of the 
animal, and the like but can be determined by those skilled in the art. 
25 Methods of identifying ligands 

The invention also provides assays to identify compounds that bind nGPCR-x. 
One such assay comprises the steps of: (a) contacting a composition comprising a 
nGPCR-x with a compound suspected of binding nGPCR-x; and (b) measuring 
binding between the compound and nGPCR-x. In one variation, the composition 
30 comprises a cell expressing nGPCR-x on its surface. In another variation, isolated 
nGPCR-x or cell membranes comprising nGPCR-x are employed. The binding may 
be measured directly, e.g., by using a labeled compound, or may be measured 
indirectly by several techniques, including measuring intracellular signaling of 
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nGPCR-x induced by the compound (or measuring changes in the level of nGPCR-x 
signaling). 

Specific binding molecules, including natural ligands and synthetic 
compounds, can be identified or developed using isolated or recombinant nGPCR-x 

5 products, nGPCR-x variants, or preferably, cells expressing such products. Binding 
partners are useful for purifying nGPCR-x products and detection or quantification of 
nGPCR-x products in fluid and tissue samples using known immunological 
procedures. Binding molecules are also manifestly useful in modulating (i.e., 
blocking, inhibiting or stimulating) biological activities of nGPCR-x, especially those 

10 activities involved in signal transduction. 

The DNA and amino acid sequence information provided by the present 
invention also makes possible identification of binding partner compounds with which 
a nGPCR-x polypeptide or polynucleotide will interact. Methods to identify binding 
partner compounds include solution assays, in vitro assays wherein nGPCR-x 

15 polypeptides are immobilized, and cell-based assays. Identification of binding partner 
compounds of nGPCR-x polypeptides provides candidates for therapeutic or 
prophylactic intervention in pathologies associated with nGPCR-x normal and 
aberrant biological activity. 

The invention includes several assay systems for identifying nGPCR-x 

20 binding partners. In solution assays, methods of the invention comprise the steps of 
(a) contacting a nGPCR-x polypeptide with one or more candidate binding partner 
compounds and (b) identifying the compounds that bind to the nGPCR-x polypeptide. 
Identification of the compounds that bind the nGPCR-x polypeptide can be achieved 
by isolating the nGPCR-x polypeptide/binding partner complex, and separating the 

25 binding partner compound from the nGPCR-x polypeptide. An additional step of 
characterizing the physical, biological, and/or biochemical properties of the binding 
partner compound is also comprehended in another embodiment of the invention. In 
one aspect, the nGPCR-x polypeptide/binding partner complex is isolated using an 
antibody immunospecific for either the nGPCR-x polypeptide or the candidate 

30 binding partner compound. 

In still other embodiments, either the nGPCR-x polypeptide or the candidate 
binding partner compound comprises a label or tag that facilitates its isolation, and 
methods of the invention to identify binding partner compounds include a step of 
isolating the nGPCR-x polypeptide/binding partner complex through interaction with 
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the label or tag. An exemplary tag of this type is a poly-histidine sequence, generally 
around six histidine residues, that permits isolation of a compound so labeled using 
nickel chelation. Other labels and tags, such as the FLAG® tag (Eastman Kodak, 
Rochester, NY), well known and routinely used in the art, are embraced by the 
5 invention. 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized nGPCR-x polypeptide with a 
candidate binding partner compound and (b) detecting binding of the candidate 
compound to the nGPCR-x polypeptide. In an alternative embodiment, the candidate 
10 binding partner compound is immobilized and binding of nGPCR-x is detected. 
Immobilization is accomplished using any of the methods well known in the art, 
including covalent bonding to a support, a bead, or a chromatographic resin, as well 
as non-covalent, high affinity interactions such as antibody binding, or use of 
streptavidin/biotin binding wherein the immobilized compound includes a biotin 
15 moiety. Detection of binding can be accomplished (i) using a radioactive label on the 
compound that is not immobilized, (ii) using of a fluorescent label on the non- 
immobilized compound, (iii) using an antibody immunospecific for the non- 
immobilized compound, (iv) using a label on the non-immobilized compound that 
excites a fluorescent support to which the immobilized compound is attached, as well 
20 as other techniques well known and routinely practiced in the art. 

The invention also provides cell-based assays to identify binding partner 
compounds of a nGPCR-x polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting a nGPCR-x polypeptide expressed on the 
surface of a cell with a candidate binding partner compound and detecting binding of 
25 the candidate binding partner compound to the nGPCR-x polypeptide. In a preferred 
embodiment, the detection comprises detecting a calcium flux or other physiological 
event in the cell caused by the binding of the molecule. 

Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either nGPCR-x or nucleic acid molecules encoding nGPCR- 
30 x, comprising contacting nGPCR-x, or a nucleic acid molecule encoding the same, 
with a compound, and determining whether the compound binds nGPCR-x or a 
nucleic acid molecule encoding the same. Binding can be determined by binding 
assays which are well known to the skilled artisan, including, but not limited to, gel- 
shift assays, Western blots, radiolabeled competition assay, phage-based expression 
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cloning, co-fractionation by chromatography, co-precipitation, cross linking, 
interaction trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, 
which axe described in, for example, Current Protocols in Molecular Biology\ 1999, 
John Wiley & Sons, NY, which is incorporated herein by reference in its entirety. 
5 The compounds to be screened include (which may include compounds which are 
suspected to bind nGPCR-x, or a nucleic acid molecule encoding the same), but are 
not limited to, extracellular, intracellular, biologic or chemical origin. The methods of 
the invention also embrace ligands, especially neuropeptides, that are attached to a 
label, such as a radiolabel (e.g., 125 I, 35 S, 32 P, 33 P, 3 H), a fluorescence label, a 

10 chemiluminescent label, an enzymic label and an immunogenic label. Modulators 
falling within the scope of the invention include, but are not limited to, non-peptide 
molecules such as non-peptide mimetics, non-peptide allosteric effectors, and 
peptides. The nGPCR-x polypeptide or polynucleotide employed in such a test may 
either be free in solution, attached to a solid support, borne on a cell surface or located 

15 intracellularly or associated with a portion of a cell. One skilled in the art can, for 
example, measure the formation of complexes between nGPCR-x and the compound 
being tested. Alternatively, one skilled in the art can examine the diminution in 
complex formation between nGPCR-x and its substrate caused by the compound 
being tested. 

20 In another embodiment of the invention, high throughput screening for 

compounds having suitable binding affinity to nGPCR-x is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid 
substrate. The peptide test compounds are contacted with nGPCR-x and washed. 
Bound nGPCR-x is then detected by methods well known in the art. Purified 

25 polypeptides of the invention can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. In addition, non-neutralizing antibodies 
can be used to capture the protein and immobilize it on the solid support. 

Generally, an expressed nGPCR-x can be used for HTS binding assays in 
conjunction with its defined ligand, in this case the corresponding neuropeptide that 

30 activates it. The identified peptide is labeled with a suitable radioisotope, including, 
but not limited to, !25 1, 3 H, 35 S or 32 P, by methods that are well known to those skilled 
in the art. Alternatively, the peptides may be labeled by well-known methods with a 
suitable fluorescent derivative (Baindur et aL, Drug Dev. Res., 1994, 33, 373-398; 
Rogers, Drug Discovery Today, 1997, 2, 156-160). Radioactive ligand specifically 
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bound to the receptor in membrane preparations made from the cell line expressing 
the recombinant protein can be detected in HTS assays in one of several standard 
ways, including filtration of the receptor-ligand complex to separate bound hgand 
from unbound ligand (Williams, Med. Res. Rev., 1991, / 7, 147-184; Sweetnam et ah, 
5 y. Natural Products, 1993, 56, 441-455). Alternative methods include a scintillation 
proximity assay (SPA) or a FlashPlate format in which such separation is unnecessary 
(Nakayama, Cur. Opinion Drug Disc. Dev., 1998, 7, 85-91 Bosse et al, J. 
Biomolecular Screening, 1998, 3, 285-292.)- Binding of fluorescent ligands can be 
detected in various ways, including fluorescence energy transfer (FRET), direct 
1 o spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers, Drug Discovery Today, 1997 2, 156-160; Hill, Cur. Opinion Drug Disc. 
Dev., 1998, I, 92-97). 

Other assays may be used to identify specific ligands of a nGPCR-x receptor, 
including assays that identify ligands of the target protein through measuring direct 
15 binding of test ligands to the target protein, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirectly using the yeast two-hybrid system described in 
Fields et al., Nature, 340:245-246 (1989), and Fields et al, Trends in Genetics, 
20 1 0:286-292 (1994), both of which are incorporated herein by reference. The two- 
hybrid system is a genetic assay for detecting interactions between two proteins or 
polypeptides. It can be used to identify proteins that bind to a known protein of 
interest, or to delineate domains or residues critical for an interaction. Variations on 
this methodology have been developed to clone genes that encode DNA binding 
25 proteins, to identify peptides that bind to a protein, and to screen for drugs. The two- 
hybrid system exploits the ability of a pair of interacting proteins to bring a 
transection activation domain into close proximity with a DNA binding domain that 
binds to an upstream activation sequence (UAS) of a reporter gene, and is generally 
performed in yeast. The assay requires the construction of two hybrid genes encoding 
30 (1) a DNA-binding domain that is fused to a first protein and (2) an activation domain 
fused to a second protein. The DNA-binding domain targets the first hybrid protein to 
the UAS of the reporter gene; however, because most proteins lack an activation 
domain, this DNA-binding hybrid protein does not activate transcription of the 
reporter gene. The second hybrid protein, which contains the activation domain, 
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cannot by itself activate expression of the reporter gene because it does not bind the 
UAS. However, when both hybrid proteins are present, the noncovalent interaction of 
the first and second proteins tethers the activation domain to the UAS, activating 
transcription of the reporter gene. For example, when the first protein is a GPCR 
5 gene product, or fragment thereof, that is known to interact with another protein or 
nucleic acid, this assay can be used to detect agents that interfere with the binding 
interaction. Expression of the reporter gene is monitored as different test agents are 
added to the system. The presence of an inhibitory agent results in lack of a reporter 
signal. 

10 The function of nGPCR-x gene products is unclear and no ligands have yet 

been found which bind the gene product. The yeast two-hybrid assay can also be used 
to identify proteins that bind to the gene product. In an assay to identify proteins that 
bind to a nGPCR-x receptor, or fragment thereof, a fusion polynucleotide encoding 
both a nGPCR-x receptor (or fragment) and a UAS binding domain (i.e., a first 

15 protein) may be used. In addition, a large number of hybrid genes each encoding a 
different second protein fused to an activation domain are produced and screened in 
the assay. Typically, the second protein is encoded by one or more members of a total 
cDNA or genomic DNA fusion library, with each second protein-coding region being 
fused to the activation domain. This system is applicable to a wide variety of 

20 proteins, and it is not even necessary to know the identity or function of the second 
binding protein. The system is highly sensitive and can detect interactions not 
revealed by other methods; even transient interactions may trigger transcription to 
produce a stable mRNA that can be repeatedly translated to yield the reporter protein. 
Other assays may be used to search for agents that bind to the target protein. 

25 One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that proteins generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the 

30 target protein), the target protein molecule bound by the ligand remains in its folded 
state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absence of a ligand. Binding of 
the ligand to the target protein can be determined by any method that distinguishes 
between the folded and unfolded states of the target protein. The function of the 
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target protein need not be known in order for this assay to be performed. Virtually 
any agent can be assessed by this method as a test ligand, including, but not limited 
to, metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

5 Another method for identifying ligands of a target protein is described in 

Wieboldt et al, Anal. Chem., 69:1683-1691 (1997), incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 
phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple membrane washing. The 

10 specifically selected molecules that are retained on the filter are subsequently 

liberated from the target protein and analyzed by HPLC and pneumatically assisted 
electrospray (ion spray) ionization mass spectroscopy. This procedure selects library 
components with the greatest affinity for the target protein, and is particularly useful 
for small molecule libraries. 

15 Other embodiments of the invention comprise using competitive screening 

assays in which neutralizing antibodies capable of binding a polypeptide of the 
invention specifically compete with a test compound for binding to the polypeptide. 
In this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with nGPCR-x. Radiolabeled competitive 

20 binding studies are described in A.H. Lin et al. Antimicrobial Agents and 

Chemotherapy, 1997, vol. 41, no. 10. pp. 2127-2131, the disclosure of which is 
incorporated herein by reference in its entirety. 

As described above and in Example 4 below, the genes encoding nGPCR-1 
(nucleic acid sequence SEQ ID NO: 1, SEQ ID NO: 73, amino acid sequence SEQ ID 

25 NO: 2, SEQ ID NO:74), nGPCR-9 (nucleic acid sequence SEQ ID NO:9, SEQ ID 
NO:77, amino acid sequence SEQ ID NO:10, SEQ ID NO:78), nGPCR-1 1 (nucleic 
acid sequence SEQ ID NO:l 1, SEQ ID NO:79, amino acid sequence SEQ ID NO: 12, 
SEQ ID NO:80), nGPCR-16 (nucleic acid sequence SEQ ID NO: 21, SEQ ID NO:81, 
amino acid sequence SEQ ID NO: 22, SEQ ID NO:82), nGPCR-40 (nucleic acid 

30 sequence SEQ ID NO:53, SEQ ID NO:83, amino acid sequence SEQ ID NO:54, SEQ 
ID NO:84), nGPCR-54 (nucleic acid sequence SEQ ID NO:59, SEQ ID NO:85, 
amino acid sequence SEQ ID NO:60, SEQ ID NO: 86), nGPCR-56 (nucleic acid 
sequence SEQ ID NO:63, SEQ ID NO:87, SEQ ID NO:89, amino acid sequence SEQ 
ID NO:64, SEQ ID NO: 88, SEQ ID NO:90), nGPCR-58 (nucleic acid sequence SEQ 
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ID NO:67, SEQ ID NO:91, SEQ ID NO:93, amino acid sequence SEQ ED NO:68, 
SEQ ID NO: 92, SEQ ID NO:94), and nGPCR-3 (nucleic acid sequence SEQ ID 
NO:3, SEQ ID NO: 185, amino acid sequence SEQ ID NO:4, SEQ ID NO: 186) have 
been detected in brain tissue indicating that these n-GPCR-x proteins are 
neuroreceptors. Accordingly, natural binding partners of these molecules include 
neurotransmitters. 

Identification of modulating agents 

The invention also provides methods for identifying a modulator of binding 
between a nGPCR-x and a nGPCR-x binding partner, comprising the steps of: (a) 
contacting a nGPCR-x binding partner and a composition comprising a nGPCR-x in 
the presence and in the absence of a putative modulator compound; (b) detecting 
binding between the binding partner and the nGPCR-x; and (c) identifying a putative 
modulator compound or a modulator compound in view of decreased or increased 
binding between the binding partner and the nGPCR-x in the presence of the putative 
modulator, as compared to binding in the absence of the putative modulator. 

nGPCR-x binding partners that stimulate nGPCR-x activity are useful as 
agonists in disease states or conditions characterized by insufficient nGPCR-x 
signaling (e.g., as a result of insufficient activity of a nGPCR-x ligand). nGPCR-x 
binding partners that block ligand-mediated nGPCR-x signaling are useful as nGPCR- 
x antagonists to treat disease states or conditions characterized by excessive nGPCR-x 
signaling. In addition nGPCR-x modulators in general, as well as nGPCR-x 
polynucleotides and polypeptides, are useful in diagnostic assays for such diseases or 
conditions. 

In another aspect, the invention provides methods for treating a disease or 
abnormal condition by administering to a patient in need of such treatment a 
substance that modulates the activity or expression of a polypeptide having the 
sequence of even numbered sequences ranging from SEQ ID NO: 2 to SEQ ID NO: 
94 and SEQ ID NO: 186. 

Agents that modulate (i.e., increase, decrease, or block) nGPCR-x activity or 
expression may be identified by incubating a putative modulator with a cell 
containing a nGPCR-x polypeptide or polynucleotide and determining the effect of 
the putative modulator on nGPCR-x activity or expression. The selectivity of a 
compound that modulates the activity of nGPCR-x can be evaluated by comparing its 
effects on nGPCR-x to its effect on other GPCR compounds. Selective modulators 
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may include, for example, antibodies and other proteins, peptides, or organic 
molecules that specifically bind to a nGPCR-x polypeptide or a nGPCR-x-encoding 
nucleic acid. Modulators of nGPCR-x activity will be therapeutically useful in 
treatment of diseases and physiological conditions in which normal or aberrant 
nGPCR-x activity is involved. nGPCR-x polynucleotides, polypeptides, and 
modulators may be used in the treatment of such diseases and conditions as 
infections, such as viral infections caused by HIV-1 or HIV-2; pain; cancers; 
Parkinson's disease; hypotension; hypertension; and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, 
severe mental retardation and dyskinesias, such as Huntington's disease or Tourette's 
Syndrome, among others. nGPCR-x polynucleotides and polypeptides, as well as 
nGPCR-x modulators, may also be used in diagnostic assays for such diseases or 
conditions. 

Methods of the invention to identify modulators include variations on any of 
the methods described above to identify binding partner compounds, the variations 
including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the nGPCR-x 
polypeptide and the binding partner compound changes in the presence of the 
candidate modulator compared to binding in the absence of the candidate modulator 
compound. A modulator that increases binding between the nGPCR-x polypeptide 
and the binding partner compound is described as an enhancer or activator, and a 
modulator that decreases binding between the nGPCR-x polypeptide and the binding 
partner compound is described as an inhibitor. 

The invention also comprehends high-throughput screening (HTS) assays to 
identify compounds that interact with or inhibit biological activity (i.e., affect 
enzymatic activity, binding activity, etc.) of a nGPCR-x polypeptide. HTS assays 
permit screening of large numbers of compounds in an efficient manner. Cell-based 
HTS systems are contemplated to investigate nGPCR-x receptor-ligand interaction. 
HTS assays are designed to identify "hits" or "lead compounds" having the desired 
property, from which modifications can be designed to improve the desired property. 
Chemical modification of the "hit" or "lead compound" is often based on an 
identifiable structure/activity relationship between the "hit" and the nGPCR-x 
polypeptide. 
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Another aspect of the present invention is directed to methods of identifying 
compounds which modulate (i.e., increase or decrease) activity of nGPCR-x 
comprising contacting nGPCR-x with a compound, and determining whether the 
compound modifies activity of nGPCR-x. The activity in the presence of the test 
5 compared is measured to the activity in the absence of the test compound. Where the 
activity of the sample containing the test compound is higher than the activity in the 
sample lacking the test compound, the compound will have increased activity. 
Similarly, where the activity of the sample containing the test compound is lower than 
the activity in the sample lacking the test compound, the compound will have 

10 inhibited activity. 

The present invention is particularly useful for screening compounds by using 
nGPCR-x in any of a variety of drug screening techniques. The compounds to be 
screened include (which may include compounds which are suspected to modulate 
nGPCR-x activity), but are not limited to, extracellular, intracellular, biologic or 

15 chemical origin. The nGPCR-x polypeptide employed in such a test may be in any 
form, preferably, free in solution, attached to a solid support, borne on a cell surface 
or located intracellularly. One skilled in the art can, for example, measure the 
formation of complexes between nGPCR-x and the compound being tested. 
Alternatively, one skilled in the art can examine the diminution in complex formation 

20 between nGPCR-x and its substrate caused by the compound being tested. 

The activity of nGPCR-x polypeptides of the invention can be determined by, 
for example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of nGPCR-x polypeptides can be assayed 
by examining their ability to bind calcium ions, hormones, chemokines, 

25 neuropeptides, neurotransmitters, nucleotides, lipids, odorants, and photons. 
Alternatively, the activity of the nGPCR-x polypeptides can be determined by 
examining the activity of effector molecules including, but not limited to, adenylate 
cyclase, phospholipases and ion channels. Thus, modulators of nGPCR-x polypeptide 
activity may alter a GPCR receptor function, such as a binding property of a receptor 

30 or an activity such as G protein-mediated signal transduction or membrane 

localization. In various embodiments of the method, the assay may take the form of 
an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP assay such as a 
[ 35 S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a diacylglycerol 
assay, an Aequorin assay, a Luciferase assay, a FLIPR assay for intracellular Ca 2 + 
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concentration, a mitogenesis assay, a MAP Kinase activity assay, an arachidonic acid 
release assay (e.g., using [ 3 H]-arachidonic acid), and an assay for extracellular 
acidification rates, as well as other binding or function-based assays of nGPCR-x 
activity that are generally known in the art. In several of these embodiments, the 
5 invention comprehends the inclusion of any of the G proteins known in the art, such 
as G , 6 , G I5 , or chimeric G qd5 , G qs5 , G qo5 , G q25 , and the like. nGPCR-x activity can be 
determined by methodologies that are used to assay for FaRP activity, which is well 
known to those skilled in the art. Biological activities of nGPCR-x receptors 
according to the invention include, but are not limited to, the binding of a natural or 
10 an unnatural ligand, as well as any one of the functional activities of GPCRs known in 
the art. Non-limiting examples of GPCR activities include transmembrane signaling 
of various forms, which may involve G protein association and/or the exertion of an 
influence over G protein binding of various guamdylate nucleotides; another 
exemplary activity of GPCRs is the binding of accessory proteins or polypeptides that 
1 5 di ffer from known G proteins. 

The modulators of the invention exhibit a variety of chemical structures, 
which can be generally grouped into non-peptide mimetics of natural GPCR receptor 
ligands, peptide and non-peptide allosteric effectors of GPCR receptors, and peptides 
that may function as activators or inhibitors (competitive, uncompetitive and non- 
20 competitive) (e.g. , antibody products) of GPCR receptors. The invention does not 
restrict the sources for suitable modulators, which may be obtained from natural 
sources such as plant, animal or mineral extracts, or non-natural sources such as small 
molecule libraries, including the products of combinatorial chemical approaches to 
library construction, and peptide libraries. Examples of peptide modulators of GPCR 
25 receptors exhibit the following primary structures: GLGPRPLRF amide, 

GNSFLRF amide, GGPQGPLRFamide, GPSGPLRFamide, PDVDHVFLRF amide, 
and pyro-EDVDHVFLRFamide. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
30 described in, for example, Enzyme Assays: A Practical Approach, eds. R. Eisenthal 
and M. J. Danson, 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs encoding GPCRs in drug discovery programs is well- 
known; assays capable of testing thousands of unknown compounds per day in high- 
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throughput screens (HTSs) are thoroughly documented. The literature is replete with 
examples of the use of radiolabeled Hgands in HTS binding assays for drug discovery 
(see Williams, Medicinal Research Reviews, 1991, 11, 147-184.; Sweetnam, et al, J. 
Natural Products, 1993, 56, 441-455 for review). Recombinant receptors are 
5 preferred for binding assay HTS because they allow for better specificity (higher 

relative purity), provide the ability to generate large amounts of receptor material, and 
can be used in a broad variety of formats (see Hodgson, Bio/Technology, 1992, 10, 
973-980; each of which is incorporated herein by reference in its entirety). 

A variety of heterologous systems is available for functional expression of 

10 recombinant receptors that are well known to those skilled in the art. Such systems 
include bacteria (Strosberg, et al, Trends in Pharmacological Sciences, 1992, 13, 95- 
98), yeast (Pausch, Trends in Biotechnology, 1997, 15, 487-494), several kinds of 
insect cells (Vanden Broeck, Int. Rev. Cytology, 1996, 164, 189-268), amphibian cells 
(Jayawickreme et al, Current Opinion in Biotechnology, 1997, 8, 629-634) and 

15 several mammalian cell lines (CHO, HEK293, COS, etc.; see Gerhardt, et al, Eur. J. 
Pharmacology, 1997, 334, 1-23). These examples do not preclude the use of other 
possible cell expression systems, including cell lines obtained from nematodes (PCT 
application WO 98/37177). 

In preferred embodiments of the invention, methods of screening for 

20 compounds that modulate nGPCR-x activity comprise contacting test compounds 
with nGPCR-x and assaying for the presence of a complex between the compound 
and nGPCR-x. In such assays, the ligand is typically labeled. After suitable 
incubation, free ligand is separated from that present in bound form, and the amount 
of free or uncomplexed label is a measure of the ability of the particular compound to 

25 bind to nGPCR-x. 

It is well known that activation of heterologous receptors expressed in 
recombinant systems results in a variety of biological responses, which are mediated 
by G proteins expressed in the host cells. Occupation of a GPCR by an agonist results 
in exchange of bound GDP for GTP at a binding site on the G a subunit; one can use a 

30 radioactive, non-hydrolyzable derivative of GTP, GTPt( 3:, S], to measure binding of 
an agonist to the receptor (Sim et al, Neuroreport, 1996, 7, 729-733). One can also 
use this binding to measure the ability of antagonists to bind to the receptor by 
decreasing binding of GTPt[ 35 S] in the presence of a known agonist. One could 
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therefore construct a HTS based on GTPt( 35 S] binding, though this is not the 
preferred method. 

The G proteins required for functional expression of heterologous GPCRs can 
be native constituents of the host cell or can be introduced through well-known 
5 recombinant technology. The G proteins can be intact or chimeric. Often, a nearly 
universally competent G protein (e.g., G a i 6 ) is used to couple any given receptor to a 
detectable response pathway. G protein activation results in the stimulation or 
inhibition of other native proteins, events that can be linked to a measurable response. 
Examples of such biological responses include, but are not limited to, the 

10 following: the ability to survive in the absence of a limiting nutrient in specifically 

engineered yeast cells (Pausch, Trends in Biotechnology, 1997, 15, 487-494); changes 
in intracellular Ca 2+ concentration as measured by fluorescent dyes (Murphy, et al, 
Cur. Opinion Drug Disc. Dev., 1998, 1 , 192-199). Fluorescence changes can also be 
used to monitor ligand-induced changes in membrane potential or intracellular pH; an 

15 automated system suitable for HTS has been described for these purposes (Schroeder, 
et al., J. Biomolecular Screening, 1996, 1, 75-80). Melanophores prepared from 
Xenopus laevis show a ligand-dependent change in pigment organization in response 
to heterologous GPCR activation; this response is adaptable to HTS formats 
(Jayawickreme et al., Cur. Opinion Biotechnology, 1997, 8, 629-634). Assays are 

20 also available for the measurement of common second messengers, including cAMP, 
phosphoinositides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the 
ability to specifically alter the binding of GTPy[ 3:) S] in membranes prepared from 

25 these cells. In another embodiment of the invention, permanently transfected CHO 
cells could be used for the preparation of membranes which contain significant 
amounts of the recombinant receptor proteins; these membrane preparations would 
then be used in receptor binding assays, employing the radiolabeled ligand specific 
for the particular receptor. Alternatively, a functional assay, such as fluorescent 

30 monitoring of ligand-induced changes in internal Ca 2+ concentration or membrane 
potential in permanently transfected CHO cells containing each of these receptors 
individually or in combination would be preferred for HTS. Equally preferred would 
be an alternative type of mammalian cell, such as HEK293 or COS cells, in similar 
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formats. More preferred would be permanently transfected insect cell lines, such as 
Drosophila S2 cells. Even more preferred would be recombinant yeast cells 
expressing the Drosophila melanogaster receptors in HTS formats well known to 
those skilled in the art {e.g., Pausch, Trends in Biotechnology, 1997, 75, 487-494). 



inhibitors of ligand binding to nGPCR-x receptors. In one example, the nGPCR-x 
receptor is immobilized and interaction with a binding partner is assessed in the 
presence and absence of a candidate modulator such as an inhibitor compound. In 
another example, interaction between the nGPCR-x receptor and its binding partner is 
10 assessed in a solution assay, both in the presence and absence of a candidate inhibitor 
compound. In either assay, an inhibitor is identified as a compound that decreases 
binding between the nGPCR-x receptor and its binding partner. Another 
contemplated assay involves a variation of the dihybrid assay wherein an inhibitor of 
protein/protein interactions is identified by detection of a positive signal in a 
15 transformed or transfected host cell, as described in PCT publication number WO 
95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected from libraries of either potential activators or potential inhibitors. There are a 
number of different libraries used for the identification of small molecule modulators, 
20 including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random chemical structures, some of which are analogs 
of known compounds or analogs of compounds that have been identified as "hits" or 
'Meads" in other drug discovery screens, some of which are derived from natural 
25 products, and some of which arise from non-directed synthetic organic chemistry. 

Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms which are used to create mixtures for screening by: (1 ) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extraction of 
plants or marine organisms. Natural product libraries include polyketides, non- 
30 ribosomal peptides, and variants (non-natural ly occurring) thereof. For a review, see 
Science 282:63-68 (1998). Combinatorial libraries are composed of large numbers of 
peptides, oligonucleotides, or organic compounds as a mixture. These libraries are 
relatively easy to prepare by traditional automated synthesis methods, PCR, cloning,, 
or proprietary synthetic methods. Of particular interest are non-peptide combinatorial 
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libraries. Still other libraries of interest include peptide, protein, peptidomimetic, 
multiparallel synthetic collection, recombinatorial, and polypeptide libraries. For a 
review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. 
Opin. Biotechnol. 8:701-707 (1997). Identification of modulators through use of the 
various libraries described herein permits modification of the candidate "hit" (or 
"lead") to optimize the capacity of the "hit" to modulate activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly 
encompasses non-peptide modulators, as well as such peptide modulators as 
neuropeptides other than natural ligands, antibodies, antibody fragments, and 
modified compounds comprising antibody domains that are immunospecific for the 
expression product of the identified nGPCR-x gene. 

The polypeptides of the invention are employed as a research tool for 
identification, characterization and purification of interacting, regulatory proteins. 
Appropriate labels are incorporated into the polypeptides of the invention by various 
methods known in the art and the polypeptides are used to capture interacting 
molecules. For example, molecules are incubated with the labeled polypeptides, 
washed to remove unbound polypeptides, and the polypeptide complex is quantified. 
Data obtained using different concentrations of polypeptide are used to calculate 
values for the number, affinity, and association of polypeptide with the protein 
complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, 
inhibitors. In one embodiment of affinity purification, a polypeptide is covalently 
coupled to a chromatography column. Cells and their membranes are extracted, and 
various cellular subcomponents are passed over the column. Molecules bind to the 
column by virtue of their affinity to the polypeptide. The polypeptide-complex is 
recovered from the column, dissociated and the recovered molecule is subjected to 
protein sequencing. This amino acid sequence is then used to identify the captured 
molecule or to design degenerate oligonucleotides for cloning the corresponding gene 
from an appropriate cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties 
to the ligand for the nGPCR-x of the invention, but which are smaller and exhibit a 
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longer half time than the endogenous ligand in a human or animal body. When an 
organic compound is designed, a molecule according to the invention is used as a 
'Mead" compound. The design of mimetics to known pharmaceutical^ active 
compounds is a well-known approach in the development of pharmaceuticals based 
on such "lead" compounds. Mimetic design, synthesis and testing are generally used 
to avoid randomly screening a large number of molecules for a target property. 
Furthermore, structural data deriving from the analysis of the deduced amino acid 
sequences encoded by the DNAs of the present invention are useful to design new 
drugs, more specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 
sequences present in all the available databases showed a significant homology with 
the transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of nGPCR-x can be 
designed. 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
oral intake, such as in tablets. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, can be prepared for 
any route of administration including, but not limited to, oral, intravenous, cutaneous, 
subcutaneous, nasal, intramuscular or intraperitoneal. The nature of the carrier or 
other ingredients wall depend on the specific route of administration and particular 
embodiment of the invention to be administered. Examples of techniques and 
protocols that are useful in this context are, inter alia, found in Remington's 
Pharmaceutical Sciences, 16 th edition, Osol, A (ed.), 1980, which is incorporated 
herein by reference in its entirety. 

The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 
For treating human or animals, between approximately 0.5 mg/kg of body weight to 
500 mg/kg of body weight of the compound can be administered. Therapy is 
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typically administered at lower dosages and is continued until the desired therapeutic 
outcome is observed. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 
5 herein, have a variety of pharmaceutical applications and may be used, for example, 
to treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. 
In a particular embodiment, the present molecules are used in gene therapy. For a 
review of gene therapy procedures, see e.g. Anderson, Science, 1992, 256, 808-813, 
which is incorporated herein by reference in its entirety. 
10 The present invention also encompasses a method of agonizing (stimulating) 

or antagonizing a nGPCR-x natural binding partner associated activity in a mammal 
comprising administering to said mammal an agonist or antagonist to one of the above 
disclosed polypeptides in an amount sufficient to effect said agonism or antagonism. 
One embodiment of the present invention, then, is a method of treating diseases in a 
15 mammal with an agonist or antagonist of the protein of the present invention 
comprises administering the agonist or antagonist to a mammal in an amount 
sufficient to agonize or antagonize nGPCR-x-associated functions. 

In an effort to discover novel treatments for diseases, biomedical researchers 
and chemists have designed, synthesized, and tested molecules that inhibit the 
20 function of protein polypeptides. Some small organic molecules form a class of 
compounds that modulate the function of protein polypeptides. Examples of 
molecules that have been reported to inhibit the function of protein kinases include, 
but are not limited to, bis monocyclic, bicyclic or heterocyclic aryl compounds (PCT 
WO 92/20642, published November 26, 1992 by Maguire et a/.), vinylene-azaindole 
25 derivatives (PCT WO 94/14808, published July 7, 1994 by Ballinari et a/.), 1- 

cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 5,330,992), styryl compounds 
(U.S. Patent No. 5,217,999), styryl-substituted pyridyl compounds (U.S. Patent No. 
5,302,606), certain quinazoline derivatives (EP Application No. 0 566 266 At), 
seleoindoles and selenides (PCT WO 94/03427, published February 17, 1994 by 
30 Denny et ai\ tricyclic polyhydroxylic compounds (PCT WO 92/21660, published 
December 10, 1992 by Dow), and benzylphosphonic acid compounds (PCT WO 
91/15495, published October 17, 1991 by Dow et al), all of which are incorporated by 
reference herein, including any drawings. 
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Exemplary diseases and conditions amenable to treatment based on the present 
invention include, but are not limited to, thyroid disorders (e.g. thyreotoxicosis, 
myxoedema); renal failure; inflammatory conditions (e.g., Chron's disease), diseases 
related to cell differentiation and homeostasis; rheumatoid arthritis; autoimmune 
5 disorders; movement disorders; CNS disorders (e.g., pain including migraine; stroke; 
psychotic and neurological disorders, including anxiety, schizophrenia, manic 
depression, anxiety, generalized anxiety disorder, post-traumatic-stress disorder, 
depression, bipolar disorder, delirium, dementia, severe mental retardation; 
dyskinesias, such as Huntington's disease or Tourette's Syndrome; attention disorders 

10 including ADD and ADHD, and degenerative disorders such as Parkinson's, 
Alzheimer's; movement disorders, including ataxias, supranuclear palsy, etc.); 
infections, such as viral infections caused by HIV-1 or HIV-2; metabolic and 
cardiovascular diseases and disorders (e.g., type 2 diabetes, obesity, anorexia, 
hypotension, hypertension, thrombosis, myocardial infarction, cardiomyopathies, 

15 atherosclerosis, etc.); proliferative diseases and cancers (e.g., different cancers such as 
breast, colon, lung, etc., and hyperproliferative disorders such as psoriasis, prostate 
hyperplasia, etc.); hormonal disorders (e.g., male/female hormonal replacement, 
polycystic ovarian syndrome, alopecia, etc.); sexual dysfunction, among others. 
Compounds that can traverse cell membranes and are resistant to acid 

20 hydrolysis are potentially advantageous as therapeutics as they can become highly 
bioavailable after being administered orally to patients. However, many of these 
protein inhibitors only weakly inhibit function. In addition, many inhibit a variety of 
protein kinases and will therefore cause multiple side effects as therapeutics for 
diseases. 

25 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 
by Ballinari et al.) describes hydrosoluble indolinone compounds that harbor tetralin, 
naphthalene, quinoline, and indole substituents fused to the oxindole ring. These 
bicyclic substituents are in turn substituted with polar groups including hydroxylated 

30 alkyl, phosphate, and ether substituents. U.S. Patent Application Serial Nos. 

08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" by Tang et al. (Lyon & 
Lyon Docket No. 221/187) and 08/485,323, filed June 7, 1995, entitled "Benzylidene- 
Z-Indoline Compounds for the Treatment of Disease" by Tang et al. (Lyon & Lyon 
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Docket No. 223/298) and International Patent Publication WO 96/22976, published 
August 1, 1996 by Ballinari et al, all of which are incorporated herein by reference in 
their entirety, including any drawings, describe indolinone chemical libraries of 
indolinone compounds harboring other bicychc moieties as well as monocyclic 
5 moieties fused to the oxindole ring. Applications 08/702,232, filed August 23, 1996, 
entitled "Indolinone Combinatorial Libraries and Related Products and Methods for 
the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 221/187), 
08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline Compounds for the 
Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 223/298), and WO 
10 96/22976, published August 1, 1996 by Ballinari et al teach methods of indolinone 
synthesis, methods of testing the biological activity of indolinone compounds in cells, 
and inhibition patterns of indolinone derivatives, both of which are incorporated by 
reference herein, including any drawings. 

Other examples of substances capable of modulating kinase activity include, 
15 but are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 

quinazolines, tyrphostins, quinolmes, and quinoxolines referred to above include well- 
known compounds such as those descnbed in the literature. For example, 
representative publications describing quinazolines include Barker et al, EPO 
Publication No. 0 520 722 Al; Jones et al., U.S. Patent No. 4,447,608; Kabbe et al., 
20 U.S. Patent No. 4,757,072; Kaul and Vougioukas, U.S. Patent No. 5, 316,553; 

Kreighbaum and Comer, U.S. Patent No. 4,343,940; Pegg and Wardleworth, EPO 
Publication No. 0 562 734 Al; Barker et al., Proc. of Am. Assoc. for Cancer Research 
32.327 (1991); Bertino, J.R., Cancer Research 3:293-304 ( 1979); Bertmo, J.R., 
Cancer Research 9(2 part l):293-304 (1979); Curtin et al., Br. J. Cancer 53:361-368 
25 (1986); Fernandes et al., Cancer Research 43: 1 1 1 7-1 123 (1983); Ferris et al J. Org. 
Chem. 44(2): 173-1 78; Fry et al, Science 265:1093-1095 (1994); Jackman et al, 
Cancer Research 51 :5579-5586 (1981); Jones et al J. Med. Chem. 29<6):1 1 14-1 U 8; 
Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al, J. Org. Chem. 
54:351 1-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); Maxwells al, 
30 Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al, Cancer Research 
45:325-330 (19S5), Phillips and Castle, J, Heterocyclic Chem. 1 7(19) 1489-1596 
(1980); Reece et al, Cancer Research 47( 1 1 ):2996-2999 (1977); Scuher et al, Cancer 
Immunol, and Immunother. 23:A65 (1986); Sikora et al, Cancer Letters 23:289-295 
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(1984); and Sikora e/ Analytical Biochem. 172:344-355 (1988), all of which are 
incorporated herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 
5 Quinolines are described in Dolle et aL, J. Med. Chem. 37:2627-2629 (1994); 

MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et aL, J. Med. Chem. 36:425- 
432 (1993); and Burke et al. BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all 
of which are incorporated by reference in their entirety, including any drawings. 

Tyrphostins are described in Allen et aL, Clin. Exp. Immunol. 91 : 14 1-1 56 
10 (1993); Anafi et al., Blood 82:12:3524-3529 (1993); Baker et aL, J. Cell Sci. 102:543- 
555 (1992); Bilder et aL, Amer. Physiol. Soc. pp. 6363-6 143:C721-C730 (1991); 
Brunton et aL, Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert 
et aL, Experimental Cell Research 199:255-261 (1992); Dong et aL, J. Leukocyte 
Biology 53:53-60 (1993); Dong et aL, J. Immunol. 1 5 1(5):271 7-2724 (1993); Gazit et 

15 aL, J. Med. Chem. 32:2344-2352 (1989); Gazit et aL, J. Med. Chem. 36:3556-3564 
(1993); Kaur et aL, Anti-Cancer Drugs 5:213-222 (1994); King et aL, Biochem. J. 
275:413-418 (1991); Kuo et al., Cancer Letters 74:197-202 (1993); Levitzki, A., The 
FASEB J. 6:3275-3282 (1992); Lyall et aL, J. Biol. Chem. 264:14503-14509 (1989); 
Peterson et aL, The Prostate 22:335-345 (1 993); Pillemer et aL, Int. J. Cancer 50:80- 

20 85 (1992); Posner aL, Molecular Pharmacology 45:673-683 (1993); Rendu et aL, 
Biol. Pharmacology 44(5 ):881-88S (1992); Sauro and Thomas, Life Sciences 53:371- 
376 (1993); Sauro and Thomas, J. Pharm. and Experimental Therapeutics 267(3): 1 19- 
1 125 (1993); Wolbring et aL, J. BioL Chem. 269(36):22470-22472 (1994); and 
Yoneda et aL, Cancer Research 51:4430-4435 (1991); all of which are incorporated 

25 herein by reference in their entirety, including any drawings. 

Other compounds that could be used as modulators include oxindolinones 
such as those described in U.S. patent application Serial No. 08/702,232 filed August 
23, 1996, incorporated herein by reference in its entirety, including any drawings. 
Methods of determining the dosages of compounds to be administered to a 

30 patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996 and International patent 
publication number WO 96/22976, published August 1 1996, both of which are 
incorporated herein by reference in their entirety, including any drawings, figures or 
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tables. Those skilled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease 
being treated, the particular composition being used and the size and physiological 
5 condition of the patient. Therapeutically effective doses for the compounds descnbed 
herein can be estimated initially from cell culture and animal models. For example, a 
dose can be formulated in animal models to achieve a circulating concentration range 
that initially takes into account the IC 50 as determined in cell culture assays. The 
animal model data can be used to more accurately determine useful doses in humans. 
10 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be determined to facilitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compounds can be determined using detection 
1 5 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharm-acokinetic characteristics, can 
be optimized by altering the chemical structure and retesting. In this regard, 
compounds displaying good pharmacokinetic characteristics can be used as a model. 
Toxicity studies can also be carried out by measuring the blood cell 
20 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: 1) the compound is administered to mice (an untreated control 
mouse should also be used); 2) blood samples are periodically obtained via the tail 
vein from one mouse in each treatment group; and 3) the samples are analyzed for red 
and white blood cell counts, blood cell composition and the percent of lymphocytes 
25 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 

At the termination of each toxicity study, farther studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guidelines Report of the American Veterinary Medical Assoc. 
30 Panel on Euthanasia, Journal of American Veterinary Medical Assoc., 202:229-249, 
1993). Representative animals from each treatment group can then be examined by 
gross necropsy for immediate evidence of metastasis, unusual illness or toxicity. 
Gross abnormalities in tissue are noted and tissues are examined histologically. 
Compounds causing a reduction in body weight or blood components are less 
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preferred, as are compounds having an adverse effect on major organs. In general, the 
greater the adverse effect the less preferred the compound. 

For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and 
5 most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided 

plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 
Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

nGPCR-x mRNA transcripts may found in many tissues, including, but not 

10 limited to, brain, peripheral blood lymphocytes, pancreas, ovary, uterus, testis, 

salivary gland, kidney, adrenal gland, liver, bone marrow, prostate, fetal liver, colon, 
muscle, and fetal brain, and may be found in many other tissues. Within the brain, 
nGPCR-x mRNA transcripts may be found in many tissues, including, but not limited 
to, frontal lobe, hypothalamus, pons, cerebellum, caudate nucleus, and medulla. 

15 Tissues and brain regions where specific nGPCR mRNA transcripts are expressed are 
identified in the Examples, below. 

Odd numbered nucleotide sequences ranging from SEQ ID NO: 1 to SEQ ID 
NO: 93 and SEQ ID NO: 185 will, as detailed above, enable screening the 
endogenous neurotransmitters/hormones/ligands which activate, agonize, or 

20 antagonize nGPCR-x and for compounds with potential utility in treating disorders 
including, but not limited to, thyroid disorders (e.g. thyreotoxicosis, myxoedema); 
renal failure; inflammatory conditions (e.g. y Chron's disease); diseases related to cell 
differentiation and homeostasis; rheumatoid arthritis; autoimmune disorders; 
movement disorders; CNS disorders (e.g., pain including migraine; stroke; psychotic 

25 and neurological disorders, including anxiety, schizophrenia, manic depression, 
anxiety, generalized anxiety disorder, post-traumatic-stress disorder, depression, 
bipolar disorder, delirium, dementia, severe mental retardation; dyskinesias, such as 
Huntington's disease or Tourette's Syndrome; attention disorders including ADD and 
ADHD, and degenerative disorders such as Parkinson's, Alzheimer's; movement 

30 disorders, including ataxias, supranuclear palsy, etc.); infections, such as viral 

infections caused by HIV-1 orHIV-2; metabolic and cardiovascular diseases and 
disorders (e.g., type 2 diabetes, obesity, anorexia, hypotension, hypertension, 
thrombosis, myocardial infarction, cardiomyopathies, atherosclerosis, etc.); 
proliferative diseases and cancers (e.g., different cancers such as breast, colon, lung, 



-\SOOOD <WO U'J647iA£ I > 



WO 01/36473 



PCT/LS0II/315SI 



etc., and hyperproliferative disorders such as psoriasis, prostate hyperplasia, etc.); 
hormonal disorders {e.g., male/female hormonal replacement, polycystic ovarian 
syndrome, alopecia, etc.); sexual dysfunction, among others. 

For example, nGPCR-x may be useful in the treatment of respiratory ailments 

5 such as asthma, where T cells are implicated by the disease. Contraction of airway 
smooth muscle is stimulated by thrombin. Cicala et al (1999) Br J Pharmacol 
126:478-484. Additionally, in bronchiolitis obliterans, it has been noted that 
activation of thrombin receptors may be deleterious. Hauck et a/.(1999) Am J Physiol 
277:L22-L29. Furthermore, mast cells have also been shown to have thrombin 

10 receptors. Cirino et al (1996) J Exp Med 183:821-827. nGPCR-x may also be useful 
in remodeling of airway structure s in chronic pulmonary inflammation via 
stimulation of fibroblast procollagen synthesis. See, e.g., Chambers et al. (1998) 
Biochem J 333:121-127; Trejo et al. (1996) J Biol Chem 271:21536-21541. 

In another example, increased release of sCD40L and expression of CD40L by 

15 T cells after activation of thrombin receptors suggests that nGPCR-x may be useful in 
the treatment of unstable angina due to the role of T cells and inflammation. See 
Aukrust et al. (1999) Circulation 100:614-620. 

A further example is the treatment of inflammatory diseases, such as psoriasis, 
inflammatory bowel disease, multiple sclerosis, rheumatoid arthritis, and thyroiditis. 

20 Due to the tissue expression profile of nGPCR-x, inhibition of thrombin receptors 
may be beneficial for these diseases. See, e.g., Morns et al. (1996) Ann Rheum Dis 
55:841-843. In addition to T cells, NK cells and monocytes are also critical cell types 
which contribute to the pathogenesis of these diseases. See, e.g., Naldini & Carney 
(1996) Cell Immunol 172:35-42; Hoffman & Cooper (1995) Blood Cells Mol Dis 

25 21:1 56- 1 67; Colotta et al. ( 1 994) Am J Pathol 1 44:975-985. 

Expression of nGPCR-x in bone marrow and spleen may suggest that it may 
play a role in the proliferation of hematopoietic progenitor cells. See DiCuccio et al. 
(1996) Exp Hematol 24:914-918. 

As another example, nGPCR-x may be useful in the treatment of acute and/or 

30 traumatic brain injury. Astrocytes have been demonstrated to express thrombin 

receptors. Activation of thrombin receptors may be involved in astrogliosis following 
brain injury. Therefore, inhibition of receptor activity may be beneficial for limiting 
neuroinflammation. Scar formation mediated by astrocytes may also be limited by 
inhibiting thrombin receptors. See, e g, Pindon et al. (1998) Eur J Biochem 255:766- 
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774; Ubl & Reiser. (1997) Glia 21:361-369; Grabham & Cunningham (1995) J 
Neurochem 64:583-591. 

nGPCR-x receptor activation may mediate neuronal and astrocyte apoptosis 
and prevention of neurite outgrowth. Inhibition would be beneficial in both chronic 
5 and acute brain injury. See, e.g., Donovan et al. (1997) J Neurosci 17:5316-5326; 
Turgeon et al (1998) J Neurosci 18:6882-6891; Smith-Swintosky et al. (1997) J 
Neurochem 69:1890-1896; Gill et al. (1998) Brain Res 797:321-327; Suidan et al. 
( 1 996) Semin Thromb Hemost 22:125-133. 

The attached Sequence Listing contains the sequences of the polynucleotides 
10 and polypeptides of the invention and is incorporated herein by reference in its 
entirety. 

As described above and in Example 4 below, the genes encoding nGPCR-1 
(nucleic acid sequence SEQ ID NO: 1, SEQ ID NO: 73, amino acid sequence SEQ ID 
NO: 2, SEQ ID NO:74), nGPCR-9 (nucleic acid sequence SEQ ID NO:9, SEQ ID 

15 NO:77, amino acid sequence SEQ ID NO:10, SEQ ID NO:78), nGPCR-1 1 (nucleic 
acid sequence SEQ ID NO: 1 1 , SEQ ID NO:79, amino acid sequence SEQ ID NO: 1 2, 
SEQ ID NO:80), nGPCR-1 6 (nucleic acid sequence SEQ ID NO: 21, SEQ ID NO:81, 
amino acid sequence SEQ ID NO: 22, SEQ ID NO:82), nGPCR-40 (nucleic acid 
sequence SEQ ID NO:53, SEQ ID NO:83, amino acid sequence SEQ ID NO:54, SEQ 

20 ID NO:84), nGPCR-54 (nucleic acid sequence SEQ ID NO:59, SEQ ID NO:85, 
amino acid sequence SEQ ID NO:60, SEQ ID NO: 86), nGPCR-56 (nucleic acid 
sequence SEQ ID NO:63, SEQ ID NO:87, SEQ ID NO:89, amino acid sequence SEQ 
ID NO:64, SEQ ID NO: 88, SEQ ID NO:90), nGPCR-58 (nucleic acid sequence SEQ 
ID NO:3, SEQ ID NO: 185, amino acid sequence SEQ ID NO:4, SEQ ID NO: 186) 

25 have been detected in brain tissue indicating that these n-GPCR-x proteins are 

neuroreceptors. The identification of modulators such as agonists and antagonists is 
therefore useful for the identification of compounds useful to treat neurological 
diseases and disorders. Such neurological diseases and disorders, including but are 
not limited to, schizophrenia, affective disorders, ADHD/ADD {i.e., Attention 

30 Deficit-Hyperactivity Disorder/Attention Deficit Disorder), and neural disorders such 
as Alzheimer's disease, Parkinson's disease, migraine, and senile dementia as well as 
depression, anxiety, bipolar disease, epilepsy, neuntis, neurasthenia, neuropathy, 
neuroses, and the like. 
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Methods of Screening Human Subjects 

Thus in yet another embodiment, the invention provides genetic screening 
procedures that entail analyzing a person's genome -- in particular their alleles for 
GPCRs of the invention — to determine whether the individual possesses a genetic 
5 characteristic found in other individuals that are considered to be afflicted with, or at 
risk for, developing a mental disorder or disease of the brain that is suspected of 
having a hereditary component. For example, in one embodiment, the invention 
provides a method for determining a potential for developing a disorder affecting the 
brain in a human subject comprising the steps of analyzing the coding sequence of 

10 one or more GPCR genes from the human subject; and determining development 
potential for the disorder in said human subject from the analyzing step. 

More particularly, the invention provides a method of screening a human 
subject to diagnose a disorder affecting the brain or genetic predisposition therefor, 
comprising the steps of: (a) assaying nucleic acid of a human subject to determine a 

15 presence or an absence of a mutation altering the amino acid sequence, expression, or 
biological activity of at least one seven transmembrane receptor that is expressed in 
the brain, wherein the seven transmembrane receptor comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOS: 74, 186, 78, 80, 82, 84, 
86, 90, and 94, or an allelic variant thereof, and wherein the nucleic acid corresponds 

20 to the gene encoding the seven transmembrane receptor; and (b) diagnosing the 

disorder or predisposition from the presence or absence of said mutation, wherein the 
presence of a mutation altering the amino acid sequence, expression, or biological 
activity of allele in the nucleic acid correlates with an increased risk of developing the 
disorder. Ln preferred variations, the seven transmembrane receptor is nGPCR-40 or 

25 nGPCR-54 comprising amino acid sequences set forth in SEQ ID NO: 84 for nGPCR- 
40 and SEQ ID NO: 86 for nGPCR-54, or an allelic variant thereof, and the disease is 
schizophrenia. 

By "human subject" is meant any human being, human embryo, or human 
fetus. It will be apparent that methods of the present invention will be of particular 
30 interest to individuals that have themselves been diagnosed with a disorder affecting 
the brain or have relatives that have been diagnosed with a disorder affecting the 
brain. 

By "screening for an increased risk" is meant determination of whether a 
genetic variation exists in the human subject that correlates with a greater likelihood 
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of developing a disorder affecting the brain than exists for the human population as a 
whole, or for a relevant racial or ethnic human sub-population to which the individual 
belongs. Both positive and negative determinations (i.e., determinations that a genetic 
predisposition marker is present or is absent) are intended to fall within the scope of 
5 screening methods of the invention. In preferred embodiments, the presence of a 

mutation altering the sequence or expression of at least one nGPCR-40 or nGPCR-54 
seven transmembrane receptor allele in the nucleic acid is correlated with an increased 
risk of developing schizophrenia, whereas the absence of such a mutation is reported 
as a negative determination. 

10 The "assaying" step of the invention may involve any techniques available for 

analyzing nucleic acid to determine its characteristics, including but not limited to 
well-known techniques such as single-strand conformation polymorphism analysis 
(SSCP) [Orita et ai, Proc Natl. Acad. Set. USA, 86: 2766-2770 (1989)]; heteroduplex 
analysis [White et ai t Genomics, 1 2: 301-306 (1992)]; denaturing gradient gel 

15 electrophoresis analysis [Fischer et al , Proc. Natl. Acad. Sci. USA, 80: 1579-1583 
(1983 ); and Riesner et al., Electrophoresis, 10: 377-389 (1989)]; DNA sequencing; 
RNase cleavage [Myers et al., Science, 230: 1242-1246 (1985)]; chemical cleavage of 
mismatch techniques [Rowley et al., Genomics, 30: 574-582 (1995); and Roberts et 
al., Nuci Acids Res., 25: 3377-3378 (1997)]; restriction fragment length 

20 polymorphism analysis; single nucleotide primer extension analysis [Shumaker et ai, 
Hum. Mutat., 7: 346-354 (1996); and Pastinen et al.. Genome Res., 7: 606-614 
(1997)]; 5' nuclease assays [Pease et al., Proc. Natl. Acad. ScL USA, 9/:5022-5026 
(1994)]; DNA Microchip analysis [Ramsay, G., Nature Biotechnology, 16: 40-48 
(1999); and Chee et ai, U.S. Patent No. 5,837,832]; and Hgase chain reaction 

25 [Whiteley et al. , U.S. Patent No. 5,52 1 ,065]. [See generally, Schafer and Hawkins, 
Nature Biotechnology, 16: 33-39 (1998).] All of the foregoing documents are hereby 
incorporated by reference in their entirety. 

Thus, in one preferred embodiment involving screening nGPCR-40 or 
nGPCR-54 sequences, for example, the assaying step comprises at least one 

30 procedure selected from the group consisting of: (a) determining a nucleotide 

sequence of at least one codon of at least one nGPCR-40 or nGPCR-54 allele of the 
human subject; (b) performing a hybridization assay to determine whether nucleic 
acid from the human subject has a nucleotide sequence identical to or different from 
one or more reference sequences; (c) performing a polynucleotide migration assay to 
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determine whether nucleic acid from the human subject has a nucleotide sequence 
identical to or different from one or more reference sequences; and (d) performing a 
restriction endonuclease digestion to determine whether nucleic acid from the human 
subject has a nucleotide sequence identical to or different from one or more reference 
sequences. 

In a highly preferred embodiment, the assaying involves sequencing of nucleic 
acid to determine nucleotide sequence thereof, using any available sequencing 
technique. [See, e.g., Sanger et al, Proc. Natl. Acad. Sci. (USA), 74: 5463-5467 
(1977) (dideoxy chain termination method); Mirzabekov, TIB TECH, 12: 27-32 (1994) 
(sequencing by hybridization); Drmanac et al. Nature Biotechnology, 16: 54-58 
(1998); U.S. Patent No. 5,202,23 1 ; and Science, 260: 1649-1652 (1993) (sequencing 
by hybridization); Kieleczawa et ai, Science, 258: 1787-1791 (1992) (sequencing by 
primer walking); (Douglas et al, Biotechniques, 14: 824-828 (1993) (Direct 
sequencing of PCR products); and Akane et al, Biotechniques 16: 238-241 (1994); 
Maxam and Gilbert, Meth. Enzymol, 65: 499-560 (1977) (chemical termination 
sequencing), all incorporated herein by reference.] The analysis may entail 
sequencing of the entire nGPCR gene genomic DNA sequence, or portions thereof; or 
sequencing of the entire seven transmembrane receptor coding sequence or portions 
thereof. In some circumstances, the analysis may involve a determination of whether 
an individual possesses a particular allelic variant, in which case sequencing of only a 
small portion of nucleic acid -- enough to determine the sequence of a particular 
codon characterizing the allelic variant - is sufficient. This approach is appropriate, 
for example, when assaying to determine whether one family member inherited the 
same allelic variant that has been previously characterized for another family member, 
or, more generally, whether a person's genome contains an allelic variant that has 
been previously characterized and correlated with a mental disorder having a heritable 
component. 

In another highly preferred embodiment, the assaying step comprises 
performing a hybridization assay to determine whether nucleic acid from the human 
subject has a nucleotide sequence identical to or different from one or more reference 
sequences. In a preferred embodiment, the hybridization involves a determination of 
whether nucleic acid derived from the human subject will hybridize with one or more 
oligonucleotides, wherein the oligonucleotides have nucleotide sequences that 
correspond identically to a portion of the GPCR gene sequence taught herein, such as 
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the nGPCR-40 or nGPCR-54 coding sequence set forth in SEQ ID NOS: 83 for 
nGPCR-40 or 85 for nGPCR-54, or that correspond identically except for one 
mismatch. The hybridization conditions are selected to differentiate between perfect 
sequence complementarity and imperfect matches differing by one or more bases. 
5 Such hybridization experiments thereby can provide single nucleotide polymorphism 
sequence information about the nucleic acid from the human subject, by virtue of 
knowing the sequences of the oligonucleotides used in the experiments. 

Several of the techniques outlined above involve an analysis wherein one 
performs a polynucleotide migration assay, e.g., on a polyacryl amide electrophoresis 

10 gel (or in a capillary electrophoresis system), under denaturing or non-denaturing 
conditions. Nucleic acid derived from the human subject is subjected to gel 
electrophoresis, usually adjacent to (or co-loaded with) one or more reference nucleic 
acids, such as reference GPCR-encoding sequences having a coding sequence 
identical to all or a portion of SEQ ID NOS: 83 or 85 (or identical except for one 

15 known polymorphism). The nucleic acid from the human subject and the reference 
sequence(s) are subjected to similar chemical or enzymatic treatments and then 
electrophoresed under conditions whereby the polynucleotides will show a differential 
migration pattern, unless they contain identical sequences. [See generally Ausubel et 
al. (eds.), Current Protocols in Molecular Biology, New York: John Wiley & Sons, 

20 Inc. (1987-1999); and Sambrook et al., (eds.), Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press 
(1989), both incorporated herein by reference in their entirety.] 

In the context of assaying, the term "nucleic acid of a human subject'* is 
intended to include nucleic acid obtained directly from the human subject {e.g., DNA 

25 or RNA obtained from a biological sample such as a blood, tissue, or other cell or 

fluid sample); and also nucleic acid derived from nucleic acid obtained directly from 
the human subject. By way of non-limiting examples, well known procedures exist 
for creating cDNA that is complementary to RNA derived from a biological sample 
from a human subject, and for amplifying (e.g., via polymerase chain reaction (PCR)) 

30 DNA or RNA derived from a biological sample obtained from a human subject. Any 
such derived polynucleotide which retains relevant nucleotide sequence information 
of the human subject's own DNA/RNA is intended to fall within the definition of 
"nucleic acid of a human subject" for the purposes of the present invention. 
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In the context of assaying, the term "mutation" includes addition, deletion, 
and/or substitution of one or more nucleotides in the GPCR gene sequence (e.g., as 
compared to the seven transmembrane receptor-encoding sequences set forth of SEQ 
ID NOS: 74, 186, 78, 80, 82, 84, 86, 90, and 94) and other polymorphisms that occur 
in introns (where introns exist) and that are identifiable via sequencing, restriction 
fragment length polymorphism, or other techniques. The various activity examples 
provided herein permit determination of whether a mutation modulates activity of the 
relevant receptor in the presence or absence of various test substances. 

In a related embodiment, the invention provides methods of screening a 
person's genotype with respect to GPCR's of the invention, and correlating such 
genotypes with diagnoses for disease or with predisposition for disease (for genetic 
counseling). For example, the invention provides a method of screening for an 
nGPCR-40 or nGPCR-54 hereditary schizophrenia genotype in a human patient, 
comprising the steps of: (a) providing a biological sample comprising nucleic acid 
from the patient, the nucleic acid including sequences corresponding to said patient's 
nGPCR-40 or nGPCR-54 alleles; (b) analyzing the nucleic acid for the presence of a 
mutation or mutations; (c) determining an nGPCR-40 or nGPCR-54 genotype from 
the analyzing step; and (d) correlating the presence of a mutation in an nGPCR-40 or 
nGPCR-54 allele with a hereditary schizophrenia genotype. In a preferred 
embodiment, the biological sample is a cell sample containing human cells that 
contain genomic DNA of the human subject. The analyzing can be performed 
analogously to the assaying described in preceding paragraphs. For example, the 
analyzing comprises sequencing a portion of the nucleic acid (e.g., DNA or RNA), the 
portion comprising at least one codon of the nGPCR-40 or nGPCR-54 alleles. 

Although more time consuming and expensive than methods involving nucleic 
acid analysis, the invention also may be practiced by assaying protein of a human 
subject to determine the presence or absence of an amino acid sequence variation in 
GPCR protein from the human subject. Such protein analyses may be performed, 
e g y by fragmenting GPCR protein via chemical or enzymatic methods and 
sequencing the resultant peptides; or by Western analyses using an antibody having 
specificity for a particular allelic variant of the GPCR. 

The invention also provides materials that are useful for performing methods 
of the invention. For example, the present invention provides oligonucleotides useful 
as probes in the many analyzing techniques described above. In general, such 
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oligonucleotide probes comprise 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, or 50 nucleotides that have a sequence that is identical, or 
exactly complementary, to a portion of a human GPCR gene sequence taught herein 
5 (or allelic variant thereof), or that is identical or exactly complementary except for 
one nucleotide substitution. In a preferred embodiment, the oligonucleotides have a 
sequence that corresponds in the foregoing manner to a human GPCR coding 
sequence taught herein, and in particular, the coding sequences set forth in SEQ ED 
NO: 83 and 85. In one variation, an oligonucleotide probe of the invention is purified 
10 and isolated. In another variation, the oligonucleotide probe is labeled, e.g., with a 
radioisotope, chromophore, or fluorophore. In yet another variation, the probe is 
covalently attached to a solid support. [See generally Ausubel et at. And Sambrook et 
al., supra.] 



15 are useful for practicing methods of the invention. For example, the invention 

provides a kit for screening a human subject to diagnose schizophrenia or a genetic 
predisposition therefor, comprising, in association: (a) an oligonucleotide useful as a 
probe for identifying polymorphisms in a human nGPCR-40 or nGPCR-54 seven 
transmembrane receptor gene, the oligonucleotide comprising 6-50 nucleotides that 

20 have a sequence that is identical or exactly complementary to a portion of a human 
nGPCR-40 or nGPCR-54 gene sequence or nGPCR-40 or nGPCR-54 coding 
sequence, except for one sequence difference selected from the group consisting of a 
nucleotide addition, a nucleotide deletion, or nucleotide substitution; and (b) a media 
packaged with the oligonucleotide containing information identifying polymorphisms 

25 identifyable with the probe that correlate with schizophrenia or a genetic 

predisposition therefor. Exemplary information-containing media include printed 
paper package inserts or packaging labels; and magnetic and optical storage media 
that are readable by computers or machines used by practitioners who perform genetic 
screening and counseling services. The practitioner uses the information provided in 

30 the media to correlate the results of the analysis with the oligonucleotide with a 
diagnosis. In a preferred variation, the oligonucleotide is labeled. 

In still another embodiment, the invention provides methods of identifying 
those allelic variants of GPCRs of the invention that correlate with mental disorders. 
For example, the invention provides a method of identifying a seven transmembrane 



In a related embodiment, the invention provides kits comprising reagents that 
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allelic variant that correlates with a mental disorder, comprising steps of: (a) 
providing a biological sample comprising nucleic acid from a human patient 
diagnosed with a mental disorder, or from the patient's genetic progenitors or 
progeny; (b) analyzing the nucleic acid for the presence of a mutation or mutations in 
5 at least one seven transmembrane receptor that is expressed in the brain, wherein the 
at least one seven transmembrane receptor comprises an amino acid sequence selected 
from the group consisting of SEQ ID NOS: 74, 186, 78, 80, 82, 84, 86, 90, and 94 or 
an allelic variant thereof, and wherein the nucleic acid includes sequence 
corresponding to the gene or genes encoding the at least one seven transmembrane 

10 receptor; (c) determining a genotype for the patient for the at least one seven 
transmembrane receptor from said analyzing step; and (d) identifying an allelic 
variant that correlates with the mental disorder from the determining step. To 
expedite this process, it may be desirable to perform linkage studies in the patients 
(and possibly their families) to correlate chromosomal markers with disease states. 

15 The chromosomal localization data provided herein facilitates identifying an involved 
GPCR with a chromosomal marker. 

The foregoing method can be performed to correlate GPCR's of the invention 
to a number of disorders having hereditary components that are causative or that 
predispose persons to the disorder. For example, in one preferred variation, the 

20 disorder is schizophrenia, and the at least one seven transmembrane receptor 

comprises nGPCR-40 having an amino acid sequence set forth in SEQ ID NO: 84 or 
an allelic variant thereof. 

Also contemplated as part of the invention are polynucleotides that comprise 
the allelic variant sequences identified by such methods, and polypeptides encoded by 

25 the allelic variant sequences, and oligonucleotide and oligopeptide fragments therof 
that embody the mutations that have been identified. Such materials are useful in in 
vitro cell-free and cell-based assays for identifying lead compounds and therapeutics 
for treatment of the disorders. For example, the variants are used in activity assays, 
binding assays, and assays to screen for activity modulators described herein. In one 

30 preferred embodiment, the invention provides a purified and isolated polynucleotide 
comprising a nucleotide sequence encoding a nGPCR-40 or nGPCR-54 receptor 
allelic variant identified according to the methods described above; and an 
oligonucleotide that comprises the sequences that differentiate the allelic variant from 
the nGPCR-40 or nGPCR-54 sequences set forth in SEQ ID NOS: 83 and 88. The 
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invention also provides a vector comprising the polynucleotide (preferably an 
expression vector); and a host cell transformed or transfected with the polynucleotide 
or vector. The invention also provides an isolated cell line that is expressing the 
allelic variant GPCR polypeptide; purified cell membranes from such cells; purified 
5 polypeptide; and synthetic peptides that embody the allelic variation amino acid 
sequence. In one particular embodiment, the invention provides a purified 
polynucleotide comprising a nucleotide sequence encoding a nGPCR-40 seven 
transmembrane receptor protein of a human that is affected with schizophrenia; 
wherein said polynucleotide hybridizes to the complement of SEQ ID NO: 83 under 

10 the following hybridization conditions: (a) hybridization for 16 hours at 42 °C in a 

hybridization solution comprising 50% formamide, 1% SDS, 1 M NaCl, 10% dextran 
sulfate and (b) washing 2 times for 30 minutes at 60 °C in a wash solution comprising 
O.lx SSC and 1% SDS; and wherein the polynucleotide encodes a nGPCR-40 amino 
acid sequence that differs from SEQ ED NO: 84 by at least one residue. 

15 An examplary assay for using the allelic variants is a method for identifying a 

modulator of nGPCR-x biological activity, comprising the steps of: (a) contacting a 
cell expressing the allelic variant in the presence and in the absence of a putative 
modulator compound; (b) measuring nGPCR-x biological activity in the cell; and (c) 
identifying a putative modulator compound in view of decreased or increased 

20 nGPCR-x biological activity in the presence versus absence of the putative modulator. 
Additional features of the invention will be apparent from the following 
Examples. Examples 1, 2, 4, 11, 12, and 13 are actual, while the remaining Examples 
are prophetic. Additional features and variations of the invention will be apparent to 
those skilled in the art from the entirety of this application, including the detailed 

25 description, and all such features are intended as aspects of the invention. Likewise, 
features of the invention described herein can be re-combined into additional 
embodiments that also are intended as aspects of the invention, irrespective of 
whether the combination of features is specifically mentioned above as an aspect or 
embodiment of the invention. Also, only such limitations which are described herein 

30 as critical to the invention should be viewed as such; variations of the invention 

lacking limitations which have not been described herein as critical are intended as 
aspects of the invention. 
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EXAMPLES 

EXAMPLE 1 : IDENTIFICATION OF nGPCR-X 

A. Database search 

The Celera database was searched using known GPCR receptors as query 
sequences to find patterns suggestive of novel G protein-coupled receptors. Positive 
hits were further analyzed with the GCG program BLAST to determine which ones 
were the most likely candidates to encode G protein-coupled receptors, using the 
standard (default) alignment produced by BLAST as a guide. 

Briefly, the BLAST algorithm, which stands for Basic Local Alignment 
Search Tool is suitable for determining sequence similarity (Altschul et al, J. Molec. 
Biol., 1990, 215, 403-410, which is incorporated herein by reference in its entirety). 
Software for performing BLAST analyses is publicly available through the National 
Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 
positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et aL, supra). These initial neighborhood word hits act as seeds for 
initiating searches to find HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be 
increased. Extension for the word hits in each direction are halted when: 1) the 
cumulative alignment score falls off by the quantity X from its maximum achieved 
value; 2) the cumulative score goes to zero or below, due to the accumulation of one 
or more negative-sconng residue alignments; or 3) the end of either sequence is 
reached. The Blast algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. The Blast program uses as defaults a word length (W) of 1 1, 
the BLOSUM62 scoring matrix (see Henikoff et aL, Proc. Natl. Acad. Sci. USA, 
1992, 89, 10915-10919, which is incorporated herein by reference in its entirety) 
alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both 
strands. 

The BLAST algorithm (Karlin et aL, Proc. Natl. Acad. Sci. USA, 1993, 90, 
5873-5787, which is incorporated herein by reference in its entirety) and Gapped 
BLAST perform a statistical analysis of the similarity between two sequences. One 
measure of similarity provided by the BLAST algorithm is the smallest sum 
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probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For 
example, a nucleic acid is considered similar to a GPCR gene or cDNA if the smallest 
sum probability in comparison of the test nucleic acid to a GPCR nucleic acid is less 
than about 1, preferably less than about 0.1, more preferably less than about 0.01, and 
most preferably less than about 0.001 . 

Homology searches were performed with the program BLAST version 2.08. A 
collection of 340 query amino acid sequences derived from GPCR's was used to 
search the genomic DNA sequence using TBLASTN and alignments with an E-value 
lower than 0.01 were collected from each BLAST search. The amino acid sequences 
have been edited to remove regions in the sequence that produce non-significant 
alignments with proteins that are not related to GPCR's. 

Multiple query sequences may have a significant alignment to the same 
genomic region, although each alignment may not cover exactly the same DNA 
region. A procedure is used to determine the region of maximum common overlap 
between the alignments from several query sequences. This region is called the 
consensus DNA region. The procedure for determining this consensus involves the 
automatic parsing of the BLAST output files using the program MSPcrunch to 
produce a tabular report. From this tabular report the start and end of each alignment 
in the genomic DNA is extracted. This information was used by a PERL script to 
derive the maximum common overlap. These regions were reported in the form of a 
unique sequence identifier, a start and the end position in the sequence. The 
sequences defined by these regions were extracted from the original genomic 
sequence file using the program fetchdb. 

The consensus regions were assembled into a non-redundant set by using the 
program phrap. After assembly with phrap a set of contigs and singletons was 
defined as candidate DNA regions coding for nGPCR-x. These sequences were then 
submitted for further sequence analysis. 

Further sequence analysis involved the removal of sequences previously 
isolated and removal of sequences related to olfactory GPCRs. The transmembrane 
regions for the sequences that remained were determined using a FORTRAN 
computer program called "tmtrest.aH" [Parodi et al., Comput.Appl.Biosci. 5:527- 
535(1994)]. Only sequences that contained transmembrane regions in a pattern found 
in GPCRs were retained. 
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cDNAs were sequenced directly using an ABI377 fluorescence-based 
sequencer (Perkin-Elmer/ Applied Biosystems Division, PE/ABD, Foster City, CA) 
and the ABI PRISM™ Ready Dye-Deoxy Terminator kit with Taq FS™ polymerase. 
Each ABI cycle sequencing reaction contained about 0.5 ptg of plasmid DNA. 
Cycle-sequencing was performed using an initial denaturation at 98°C for 1 minute, 
followed by 50 cycles using the following parameters: 98°C for 30 seconds, annealing 
at 50°C for 30 seconds, and extension at 60°C for 4 minutes. Temperature cycles and 
times were controlled by a Perkin-Elmer 9600 thermocycler. Extension products 
were purified using Centriflex™ gel filtration cartridges (Advanced Genetic 
Technologies Corp., Gaithersburg, MD). Each reaction product was loaded by pipette 
onto the column, which is then centrifuged in a swinging bucket centrifuge (Sorvall 
model RT6000B tabletop centrifuge) at 1500 x g for 4 minutes at room temperature. 
Column-purified samples were dried under vacuum for about 40 minutes and then 
dissolved in 5 /xl of a DNA loading solution (83% deionized formamide, 8.3 mM 
EDTA, and 1 .6 mg/ml Blue Dextran). The samples were then heated to 90°C for 
three minutes and loaded into the gel sample wells for sequence analysis using the 
ABI377 sequencer. Sequence analysis was performed by importing ABI377 files into 
the Sequencer program (Gene Codes, Ann Arbor, MI). Generally, sequence reads of 
700 bp were obtained. Potential sequencing errors were minimized by obtaining 
sequence information from both DNA strands and by re-sequencing difficult areas 
using primers annealing at different locations until all sequencing ambiguities were 
removed. 

The following Table 5 contains the sequences of the polynucleotides and 
polypeptides of the invention. Start and stop codons within the polynucleotide 
sequence are identified by boldface type. The transmembrane domains within the 
polypeptide sequence are identified by underlining. 
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Table 5 



The following DNA sequence beGPCR-aeql <SEQ ID NO. l> was identified in 
H. sapiens: 

GTCTGGGGGTGGGGGATGCTGGGACAGGGGTCAATTGCCTGAAGCAAGTGCTCTCATCCCCCTAGCTCCTGC 
TGATCTAGTTGGGGCTCCAGAGTGGGGAGGAGAAAGGCACTTTGAAACTTCTCTGCCCTTACCGTCTTAGCC 
ATCAAACTCTGAGCTGGAGATAGTGACGATGTGACAGGAACTTTCCCTGGGCCTCTCTGGGCCACAATTCCT 
GGCCGAGAGAAAGAGGAGGAATGAGGTGAGCACCTTCTTCACTCCTAGGGCCATGTGGTAGAGCTGCAGTCG 
CACCTCCTTCTGCCAATAGGCATAGATGAGTGGGTTGAGGAGGGAGTTGCCCACGCCGAGCAGCCACAGGTA 
CCGTTCCAGCACTAGGTAGAGGTGACACTCCTGGCAGGCCACCTGCACAATGCCAGTGATAAGGAAGGGGGT 
CCAGGATAGAGCAAAGCTCCCAATGAGAACAGACACAGTACGGAGAGCTTTGAAGTCGCTGGGAGTCCGTGG 
GGATCGATAACCTCCAGCCATGGCTCCTGCATGTTCCATCTTTCGAATCTGCTGGCTGTGCATGGAGGCAAT 
TTGAGCATGTCGCAGTAGAAGAAGACAAAGAGGAGCATGGCTGGGAAGAAGCCAACGCAGGAGAGGGTCAGC 
ACGAAGTGAGGGTGAAATACAGCAAAGAAGCTGCACTGCCCTTTGTAGGCAGTCTGCTGGAACATGGGGATT 
CCGAGTGGGAGGAAGCCAATGAGGTAAGACACTAACCACAGCCCGGCAATGCAGGCCCCGGCCACGAACCCA 
CTCATGATCTTCAAGTAGCGGAAGGGCTGCTTGATGGCAAGGTACCTGTCAAAGGTGATCAGCATGACCGTG 
AGGACAGAGGCAGCTGCGGAGGAAGTGACAAATGCCATCCGCAGGCTGCACAGGGTCTTCTGTGTGGGCCGA 
GAAGGGCTGGAGAGCTGGTCTGTGAGTAGGCCAGAGATGGCCACACCAATCAAGGTGTCAGCCACAGCCAGA 
TTCAAGGTGAAGCAGAGAGTGACACCATCATTCTTGTGGATCAACAGCAGCACAGCCACAGCCACTAGTGTG 
TTAGTAGCAATGATGAGGGAGGCCAGGACAGCAAGGATCACTCCAAATGAGAAAGATGATTCCATGTCTCGA 
AGTGGCAGGACTTCACTTACCAGGGCATG 

The following amino acid sequence <SEQ ID NO. 2> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 1: 

MESSF SFGVILAVLASLIIATNTLVAVAVL LLIHKNDGV SLCFTLNLAVADTLIGVAISGLLT DQLSSPSRPT 
QKTLCSLR MAFVTS5AAASVLTVMLIT FDRYLAIKQPFRYLKIMSG FVAGACIAGLWLVSYLIGFLPLGIP MF 
QQTAYKGQCSFFAVFHP HFVLTLSCVGFFPAWLLFVFFYCDM LKIASMHSQ0IRKMEHAGAMAGGYRSPRTP5? 
DFKALRT V5VLIGSFALSWTPFLITGIVQ VACQECHLYLVLER YLWLLGVGNSLLNPLIYAYW QKEVRLQLYH 
MALGVKKVLTSFLLFLSARNCGPERPRESSCHI VTISSSEFDG 



The following DNA sequence beGPCR - seq3 < SEQ ID NO. 3> was identified in H . 



CAGCGCGAGCGCCTTCATGGTGACGGTGTCCATGCGCTGGCAGTGTCTGCGTGCCACCCGGTGCACCTGGAG 

CGAGGTGAGGCAGAGCACCGCCAGCGGCAGCACGAAGCCCACGGCATGGAGCGTGGCGGTGAAGGCTGCGAA 

GCGCGGACGCTCAGGCTCGGGCGGCAGGCGCAGCGAACAGGACGCGAAGGCGCTGCTGTAGCCAAGCCACGA 

GCAGCCAAGTGCAGCGCCTGAGAAGGCCAGCGACTGTCCCCAGGCACAGCCCAGCAGCAGGCCGGCATAGCG 

CGGTCGCAGGCGTCCGGCGTAGCGCAGTGGGAAGCCCACTGCCAGCCACTGGTCTGCGCTCAGCGCCGCCAC 

GCTCAGCGCCGCGTTGGACGCCAGGAAGGTGTCCAGGAAGCCAATGACTTGGCATGCGCCGGGCGCCGACGG 

TGTCCGCCCGCGCATCACACCGAGCAGCGTGAAGGGCATGTCCAGCGCCGCCAGCAGCAGGTGGCCCAGAGA 

CAGATTCACCAGGAGGACGCCTGAGGCTCGAGTGCGGAGCTCAGCGCTGTAGGCGCAACAAAGCAGCACCAG 

TGCGTTGGATAGCAGCGCCACGGCCAGTACCATCACCAGGAGACCCGCCAGCAGCGCCTCGCCGGGGCCCAT 
GGCGCTAGC 

The following amino acid sequence <SEQ ID NO. 4> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 3: 

S AMGPGEALL AGLLVMVLAVAIjLSNAXiVLLC CAYSAEL 

TPSAPGACQ VIGFLDTFLASNAALSVAAX SADQWLAVGFPLRYAGRLRPRYA GLLLGCAWGQSLAFSGAALG C 
SWLGYSSAFASCSLRLPPEPERPRFA AFTATLHAVGFVLPLAVLCLTSLQ VHRVARRHCQRMDTVTMKALA 



The following DNA sequence beGPCR-seq4 <SEQ ID NO. 5> was identified in 
H. sapiens: 

TGTGCAGGTGTGATCTCCATTGCTTTGTACATCCCTCACACGCTGTTCGATGGGATTTTGGAAAGGAAATCT 

GTGTATTTTGGCTCACTACTGACTATCTGTTATGTACAGCATCTGTATATAACATTGTCCTCATCAGCTATG 

ATCGATACCTGTCAGTCTCAAATGCTGTAAGTCGAACACATTAATTTATCCCCCTTAGAAGATTATGTAAAT 
GTATA 

The following amino acid sequence <SEQ id NO. 6> is the predicted amino 
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acid sequence derived from the DNA sequence of SEQ ID NO. 5: 
PAr;VTSIPLYIPHTLFEWDFGKEIC VFWLTTDYLLCTASVYNi yL_ISYDRYLSV5NAVSRTHFIPLR 



The following DNA sequence beGPCR-seq5 <SEQ ID NO. 7> was identified in 
H. sapiens-. 

GACGTCGAAGCAGGTGATGATGCCCAGGGCGTGCACCGGGTAGGTGAGATCGGTGCGCGCCAGCGGGGACAGG 
GCGGTCAGGAGCAGCAGCCAGGTCCCTGCACACGCGGCCACCGCGTAACGACGGCGGCGCCAGCGCTTGGAGC 
TGAGCGGGTACAGGATCCCCAGGAAGCGCTCCACGCTGATACAGGTCATGGTGAGGATGCTGGAATACATGTT 
TGCGTAAAAGGCCACGGTCACCACGTTGCAAAGCAGCACCCCGAATACCCAGTGGTGGCGGTTGCAATGGTAG 
TAGATTTGGAAAGGCAACACGCTGGCCAGCATCAGGTCCGTGACGCTCAGGTTGATCATGAAGATGACCGACG 
GGGATCTGGGCCCCATGCGCCGGCACAGCACCCACAGAGAGAAGAGGTTGCCCGGGATGCTGACCGCCGCCAC 
CAGCGAGTACACCACGGGCAGGGCCACCGCGATCGCCGGGTTCCGCAGCATCTGCAGCGTCGCGTTGTC 

The following amino acid sequence <SEQ ID NO. 8> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 7: 

PNATTQMT,PNPZiTavaT,P\AA^ST l VAAVSIPGNLFSLVAALCRRMGPRSPS VIFMINLSVTDLMLASVLPF QIYY 
HCNRHHWV FG VLCNLVVTVAFYANMYS S I LTMTC I S V ERFLG ILYPLSS KRWRRRR Y AVAACAGTWLLLLTAL 
S PLARTDLTYP VHALG 1 1 TCFDV 



The following DNA sequence beGPCR-seq9<SEQ ID NO. 9> was identified in 



CCCATGTTCCTGCTCCTGGGCAGCCTCACGTTGTCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACAT 
CCTACTGTCGGGGCCGCTCACGCTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCG 
TGGCACTCACTGCGTCCGTGCTGAGCCTCCTGGGCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGG 
GGGCCCGCGCCCGTCTCCAGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGG 

The following amino acid sequence <SEQ ID NO. 10> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO . 9: 

P MFLLLGSLTLSDLLAGAAYAANILL SGPLTLKLSPALWFAR EGGVFVALTASVLSLLGIAL ERSLTMARRGP 
APVSSRGRTLAMAAAAW 

The following DNA sequence beGPCR-seqll <SEQ ID NO. 11> was identified in 
H. sapiens: 

CTGCTCATTGTGGCCTTTGTGCTGGGCGCACTAGGCAATGGGGTCGCCCTGTGTGGTTTCTGCTTCCACAT 
GAAGACCTGGAAGCCCAGCACTGTTTACCTTTTCAATTTGGCCGTGGCTGATTTCCTCCTTATGATCTGCC 
TGCCTTTTCGGACAGACTATTACCTCAGACGTAGACACTGGGCTTTTGGGGACATTCCCTGCCGAGTGGGG 
CTCTTCACGTTGGCCATGAACAGGGCCGGGAGCATCGTGTTCCTTACGGTGGTGGCTGCGGACAGGTATTT 
CAAAGTGGTCCACCCCCACCACGCGGTGAACACTATCTCCACCCGGGTGGCGGCTGGCATCGTCTGCACCC 
TGTGGGCCCTGGTCATCCTGGGAACAGTGTATCTTTTGCTGGAGAACCATCTCTGCGTGCAAGAGACGGCC 
GTCTCCTGTGAGAGCTTCATCATGGAGTCGGCCAATGGCTGGCATGACATCATGTTCCAGCTGGAGTTCTT 
TATGCCCCTCGGCATCATCTTATTTTGCTCCTTCAAGATTGTTTGGAGCCTGAGGCGGAGGCAGCAGCTGG 
CCAGACAGGCTCGGATGAAGAAGGCGACCCGGTTCATCATGGTGGTGGCAATTGTGTTCATCACATGCTAC 
CTGCCCAGCGTGTCTGCTAGACTCTATTTCCTCTGGACGGTGCCCTCGAGTGCCTGCGATCCCTCTGTCCA 
TGGGGCCCTGCACATAACCCTCAGCTTCACCTACATGAACAGCATGCTGGATCCCCTGGTGTATTATTTTT 
CAAGCCCCTCCTTTCCCAAATTCTACAACAAGCTCAAAATCTGCAGTCTGAAACCCAAGCAGCCAGGACAC 
TCAAAAACACAAAGGCCGGAAGAGATGCCAATTTCG 

The following amino acid sequence <SEQ ID NO. 12> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 11: 

TJ.TVAFVT.GALGNGVALCGFCFHMKTWKPST VYLFNLAVADFLLMICLP FRTDYYLRRRHWAFGDIPCRVGLF 
TLAMNRAGS IVFLTWA ADRYFKWHPHHAWTISJW 

SFIMESANGWHDIMFQLEFFMPLGIILFCSFKIVWSLRRRQQLARQARMKKATRFIMWAIVFITCYLPSVSA 



RLCKCI 



sapiens : 
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RLYFLWTVPSSACDPSVHG ALHITLSFTYMNSMLDPLVYYF SSPSFPKFYNKLKICSLKPKQPGHSKTORPFP 
MPIS 

The following DNA sequence beGPCR - seql2 <SEQ ID NO. I3> was identified in 
H. sapiens : 

TGGAGCTGTGCCACCACCTATCTGGTGAACCTGATGGTGGCCGACCTGCTTTATGTGCTATTGCCCTTCCT 
CATCATCACCTACTCACTAGATGACAGGTGGCCCTTCGGGGAGCTGCTCTGCAAGCTGGTGCACTTCCTGT 
TCTATATCAACCTTTACGGCAGCATCCTGCTGCTGACCTGCATCTCTGTGCACCAGTTCCTAGGTGTGTGC 
CACCCACTGTGTTCGCTGCCCTACCGGACCCGCAGGCATGCCTGGCTGGGCACCAGCACCACCTGGGCCCT 
GGTGGTCCTCCAGCTGCTGCCCACACTGGCCTTCTCCCACACGGACTACATCAATGGCCAGATGATCTGGT 
ATGACATGACCAGCCAAGAGAATTTTGATCGGCTTTTTGCCTACGGCATAGTTCTGACATTGTCTGGCTTT 
CTTTCCCTCCTTGGTCATTTTGGTGTGCTATTCACTGATGGTCAGGAGCCTGATCAAGCCAGAGGAGAACC 
TCATGAGGACAGG 

I The following amino acid sequence <SEQ ID NO. 14 > is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO - 13: 

WSCATT YLVNLMVADLLYVLLPFLII TYSLDDRWPFGELLC KLVHFLFYINLYGSILLLTCISVHQF LGVCHP 
LCSLPYRTRRHAW LGTSTTWAljVVLQLLPTLAF SHTDYINGQKIWYDMTSQENFDR LFAYGIVLTLSGFLSLL 
GHFGVLF TDGQEPDQARGEPHEDR 



The following DNA sequence beGPCR - seql4 < SEQ ID NO. 15> was identified in 
H. sapiens: 

CCACCACGCGCAGCACGCCGACAGGGCCTCTCCCTCCCATTCTCCCGCAGGCCCGGACGACCACGCTGCCT 
CCAGCCGGTCGGCAAACTAGGGCAGCTCGCAGCCCACGAACAGCAGCCCCAGCAGCTGGCTCATCTTCAGG 
CTCTGCACCTTGGCGCGGGGCATCGCGCTGGGCGCACGGGCTCCACCTGGGCTCGCCGACCAGGCCGCTGC 
ACCCGCTGGGGCCTTCAGCCGGTGCCGCCACCAGACGGAGAGTAGGTGGCCACAAGCGACACCCATGATCT 
TAACAGGCGCGACGAAGCCCGCGACGGCCTCATAGAACGCGTACACCTGCACGTGCCAGCGCTGCAGGAGC 
GCGAAGATCCAGTGGCAGCGACGCATCCCCGGCCAGGCTCGGGCGGAGAGTGGCGCGCCTGGCTGCAGAGA 
CGTTNNNNNNNNNNNNltfNNNNNNT 

GCAGCAGTGCCAGCAGCCAGCCCAGGGCGGCGAGGGCACGCQCGGGCAGCGGCCGGCCGTGCGGAAGACGC 
ACCGCGCGCCGGCGCTCGAGGGCGATGAGCACCACGAGGTGGGCCGAGGCGGCCCGCCCGGATGCCTGCAG 
CAGCTGCAGGAAGCGGCACGCCAGGTCCCCCGTGGCCGCGCGGGGCTCGCCCAGCAGTTCCCAGGCCAGCT 
GTGACAGCGCCGTGCCCCCGCACGCGTACAGGTCCGCCAGGGCCAGCTGCACCAGCAGGAAGTCCATCTTG 
CGACGCTTNNNNNNNNNNNNNNNN^ 

CGCCACCACCAGGATGACCCCCAGGAACACCAGGCGGACGCG 

The following amino acid sequence <SEQ ID NO. 16> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 15: 

RVRLVFLGVILWAVAGNTTVLCR LXXXXXXXXXXKRRKMD FLLVQLAI.ADLYACGGTAI.S QLAWELLGEPRA 
ATGDLACRFLQLLQASGRGASAHLWLIA1.ERRRAVRLPHG RPLPARALAALGWLLALLLA RGSGFWRYXXX 
XXXXXXXXTSLQPGAPLSARAWPGMRRCHWIFALLQRWHVQVYAFYE AVAGFVAPVKIMGVACGHLLS VWWRH 
RLKAPAGAAAWSASPGGARAPSAMPRAKVQSLKMSQLLGLLFVGCELPFADRLEAAWSSGPAGEWEGEALSAC 
CAWW 



The following DNA sequence beGPCR-seql5<SEQ ID NO. 17> was identified in 
H. sapiens: 

TCTAAGTTTTTCTCTGAACTTTGAGCCTGTGAAAAAAGAAGGGATGCTGCCTCAGGCCACCCCAGCCTAGA 
TACTCACTCTGAGTGCCATGAGGTAGTAGAGGACACTGATGACAGTCATGGGGAGGAGGTAGAATAGGAAG 
GAGGTGACCTGGATGATGAAATTGTAGATCCACATGGGCTTGATGACCGTACAGGTGGCCGAACCTGGGAC 
CAGGGACCCATTGGGGAAGTAGTGGAACTTGATGCCATGGATGCTGGTGTTGGGCAGGGAGAAGAGCACGG 
AGAAGCCCCAGACGATGCCGAGGATCCTGAGGGCCCGGCGCCGGGTGCTCTGCAGTTTGGCGCGGAACGGG 
TGTAGGATGGCCACGTAGCGCTCCACGCTGACGGTGGTGATGCTGAGGATGGAGGCGAAGCACACGGTCTC 
AAAGAGGGCCGTCTTGAAGTAGCAGCCCACGGGCCCGAACAAGAAAGGGTAGTTGCGCCACATCTCATAGA 
CCTCCAGGGGCATTCCAAGGAGCAGGACCAGGAGGTCAGAGACCGCCAGGCTGAAGAGGTAGTAGTTGGTG 
GGCGTCTTCATAGCCTGGTGCTGCAGAATCACCAGGCACACCAGGACATTGCCAATGACCCCCACCACAAA 
| AATTGGCACATACACCACAGACACGGGGAGGAAGAAGTGGCTGCGCCGAGGTGCGGAGAGGAAGGCCAGAT 
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ACTCCTCGGTGCTGTTCAGGTGTTTCTGGAATGGATCTTCTAGTTTCTGCTGGTAGATCCAGGAAGCATTC 
TGAAGTTTTTCCATCCCTGA 

The following amino acid sequence <SEQ ID NO. 18> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 17: 

griMFifT.nwa.qWTYnOKLEDPFOKHLNSTEEYLAFLCGPRRSHFFL PVSVVYVPIFVVGVIGNVLVCLVIL QHQ 
AMKTPNT YYLFSLAVSDLLVIjLLGMPL EVYEMWRNYPFLFGPVGCYFK TAIjFETVCFAS ils ittvsv eryva 
ILHPFRAKLQSTRRR ALRILGIVMGFSVLFSLPNTSI HGIKFHYFPNGSLVPGSATCTVIKPMWIYNFIIQVT 
SFLFYLLPMTVISVLYYLMALRVSIAGVAG 

The following DNA sequence beGPCR-segl8 <SEQ ID NO. 19> was identified in 
H . sapiens: 

ATCAAGATGATTTTTGCTATCGTGCAAATTATTGGATTTTCCAACTCCATCTGTAATCCCATTGTCTATGC 

atttatgaatgaaaacttcaaaaaaaatgttttgtctgcagtttgttattgcatagtaaataaaaccttct 

CTCCAGCACAAAGGCATGGAAATTCAGGAATTACAATGATGCGGAAGAAAGCAAAGTTTTCCCTCAGAGAG 
AATCCAGTG 

The following amino acid sequence <SEQ ID NO. 20> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 19: 

IKMIFAIVO IIGFSNSICNPIVY AFMNENFKKNVLSAVCYCIVNKTFSPAQRHGNSGITMMRKKAKFSLRENP 

The following DNA sequence beGPCR-seql6 <SEQ ID NO. 21> was identified in 
H. sapiens: 

GCCACAGCATGCAGTTTTCTGTAGAATTCCACTTTGTCTTTGCACTTGAAGAAGATGAGGTATCTGGTGAC 
CAGGATCACCACATAGAATAGGAACCGTGAGGTACATGTGGATGTGCAGCATGGCACTCACAAATTTGCAG 
AAGGGCAGCCCAAACATCCAAGTCTTCTTGATGAGGTAGGTCAAGCGAAATGGCACTGTCAGCAGAAAAAC 
GCTGTGGACCACCACCAAGTTAATGACCGCCATGGTGGTCACTGACCGGGTGTTCATTTTCACCAGGAGGA 
AAAGAATGGAAATGACACCCACCAGCCCGCCAATAAGCACTATGAAGTAGAGGCTGATTAAGTGGGGTGTC 
ACTATAGGATCGCAAGAGGAATTCCTGGAGGTATTGTGGCCAGGCATACTTGGGAAGTCACCTGGAGGAGA 

AAAAGCACCAGAGTAACTGAC 

The following amino acid sequence <SEQ ID NO. 22> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 21: 

vqyqnarqppnDFPSMPGHNTSRNSSCDPIVTPHL ISLYFIVLIGGLVGVISILFL LVKMNTR SVTTKAVINL 
WVHSVFLLTVPFR LTYLIKKTW MFGLPFCKFVSAMLHIHMYL TVPILCGDPGHQIPHLLQVQRQSGILQKTA 

CCG 

The following DNA sequence beGPCR- seql7<SEQ ID NO. 23> was identified in 
H. sapiens: 

ACTGACCAAGGTCAGGGCATCGACTGAGGCTAGAAGGCCACAGGAAATGCCAGTCAAGGTGTTGGCGCCTG 
CAATCGCACCTACCACAAACTTGACCGGGGGCAGGGGGGCAGGCCCGCCAGCGAACACGGTCAGCAGCACC 
AGTCCATTGCAGAGCACGGAGAGCAACACGATGGCCCACACGGCCAGGCGGATGCCCCAGCTTTCAAAGAG 
GTACTCACA 

The following amino acid sequence <SEQ ID NO. 24> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 23: 

CEYLFESWGI RLAWAIVLLSVXCNGLVLLTVFAG GPAPLPPVKFWGAIAGANTLTG ISCGLLASVDALTLV 
S 

The following DNA sequence beGPCR-seq20 <SEQ ID NO. 25> was identified in 
H. sapi ens : 

AACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGCTCCTGCGCCTGGTCTGCTGCGGACG 
CCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGCGCGGCTGAGGCTTCCGGGGGCCTGC _ 
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GCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTCAGCGGCTCGGAGCGCTCATCGCCCCAGCGCGACGGG 
CTGGACACCAGCGGCTCCACAGGCAGCCCCGGT 

The following amino acid sequence <SEQ ID NO. 26> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 25: 

NPIIYTLTNRDLRHAIjLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSFSGSERSSPORDGLD 
TSGSTGSPG 

The following DNA sequence beGPCR-seq21 <SEQ ID NO. 11 > was identified in 
H. sapiens: 

CGTGAAGAACAGCGCCACCATGACCAGCATGTGCACCACGCGCGCTCTGCGCCGCGATGCTCGCGGGTCCG 
CAGCCTCCTNNIWNNNNNNNNNNNNNN^ 

ACGATGAGCGCCAGCGGCGCCAGGTAGATGTGCGAGAAGAGCACAGTGGTGTAGACCCTGCGCATGCCCTT 
CTCGGGCCAGGCCTCCCAGCAGGAGTAGAGAGGGTAGGAGCGGTTGCGGGCGTCCACCATGAAGTGGTGCT 
CCTCACGGGTGACGGTCAGCGTGACGGCCGAGGGACACATGATGAGCAGCGCCAGGGCCCAGATGACGGCG 
ATGGTGACGAGCGCCTTCCGCAGGGTCAGCTTCTCGCGGAAAGGGTGCACGATGCAGCGGAACCT 

The following amino acid sequence <SEQ ID NO. 28> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 27: 

FRCIVHPFREKLTLRP ^U^VTIAVIWALALLIMCPSAVT'L TVTREEHHFMVTJARNRSYPLYSCWEAWPEKGM 
RRVYT TVTjFSH I YLAPLAL. I WMYAR I ARKLCXXXXXXXXXXEAADPRAS RRRARWHMLVMVALFFT 



The following DNA sequence beGPCR - seq22 <SEQ ID NO. 29> was identified in 
H. sapiens: 

GCAGGGGGCGTGAGTCCTCAGGCACTTCTTGAGGTCCTTGTTGAGCAGGAAGCAGACAATTGGGTTGACGG 
CAGCCTGGGCGAAGCTCATCCAAACAGCATGGCCAGGTAGCGGTGGGGCACAGCACAGGCTTTCACAAACA 
CTCGCCAGTAGCAGGCCACGATGTAGGGTGACCAGAGGAGCAGAAAGAGCAGTGTGATCGCGTAGAACATG 
CGGCCCAGCTGCTTTTCACCCTTGACCTCGTCCATGCCCAGTAGCCGCCGGCTGGCTGCATGCCCATTCTG 
CCGGATACCCAGCAGGGTTGGTGGCATGGGCCC 

The following amino acid sequence <SEQ ID NO. 3 0 > is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 29: 

GPMPPTLLGI RQNGHAASRRLLGMDEVKGEKQLGRMFYAITLLFLLLWSPYIVACYWRVFVKACAVPHRYLAT 
AVWMSFAQAAVNPI VCFLLNKDLKKCLRTHAPC 



The following DNA sequence beGPCR-seq24 <SEQ ID NO. 31> was identified in 
H. sapiens: 

TATTCTGTAATGAAGAATGTCATTCACACTGCCATTGGCACATCCAGTGGCCTCACCTAGCATTGTGAAAG 
CCCTTCGGTTGGTGTATTGCCACTTCATTTTAAAAGGATGCACAAGTCCCTGGTGCCTTTCCACAGCAATG 
CAGGTCATAGTGAGGATTTCTGTCACAACAGCGGTAGACTGGACAAATGGCACCATCTTGCAAATGAAAGC 
ACCTGCAGTAAGGAAATAGGATAAATCATACATCAAAACAAAAAGAATAAAGGTTTCATCTGTGTCTTTGT 
AATTATCACTATCAGTCCATTCTGAGCCTCTGCCAAAAAGTTTGATAATTGTAATTACTCTGTAGACACA 

The following amino acid sequence <SEQ ID NO. 32> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO . 31: 

VYRVITIIKLiFGRGSEWTDSDNYKDTDET FILFVLMYDLSYFLTAGAFICKMV PFVQSTAWTEILTMTClAV 
ERHQGLVHPFKMKWQYTNRRAFTMLGEATGCANGSVNDILHYRI 



The following DNA sequence beGPCR-seq27 <SEQ ID NO. 33> was identified in 
H. sapiens: 

GAGCAACATGATCTTTTTGAAGTACTTGACGGTGTCGTTCTTGACGGTCACGAAGCACAGAGTGTTGATCA 
TGCTGTTGCTCATGGCGATGCACTCGACGATGTAGAAGGCAGTGAGGTAGTGCTTCTCCTTCACAAACACG 
GTGGGGAAGAAGTCGCGCACGATGGTGAAGCCGTAGAAGGGCGCCCAGCATAGCACGTAGGCGGTGAGGAT 
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rrAr^TGAGCACCAGGACC ^TTCCTGCGGCAGCGCAGCCTCTTGCGGATCTGCTCTGTCTGGAATCCAG 
^^^^^Z^^^^^n anrn^aGrTCCCGGGAGATCCTGGCATAGCACAGGGTCATGGTGACCACGGGGCCCACG 



GGACCGCCTTGAACCAGAGCTCCCGGGAGATCCTGGCA' 
^GGATGGACAATAGCCAGATACCTAGGGCAAGAACACAAGTGGAGGCAGCC 

^Y F^^^^ 



sequence beG PCR-seq28 <SEQ ID MO. 35> was identitied in 



The following DNA 
H . sapiens: 



TTCCAGGGTGTTCAGCAGCAGGGACAGGGCCCAGGGCAGGGCACACATGACAGTTGACAGG 

cccgtttcctIccagcccgaccagggcaatgaaaaggatcaggaagaccgggatcag 

YGWCWTFDFITAVWL 



ThTT ollowing DNA s ilence beCPCR— q3KS,EQ 1U NO. i / > ^Isidentif ied in 

H. sapiens-. 

tctgacttacatcacatatgtaggcctgggcatttctatttgcagcctgatcctttgcttgt 

^^^^^^^^ 



GAGAGTCTGATTCT 



AGGCACTC 



CTTATCCTCTATGGAATCATGATTGTTTTC 



The following amino acid sequence <SEQ ID NO 38> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 37. 
ESLILTYITYVGLGISICSLILCLSVEVLVWSQVTKTEITYLRHVCIVNIAATLLMADVWFIVASFLSGPITH 
HKGCVAATFFGHFFYLSVFFWMLAKAJjLI lygimi vf 



^T e following DNA sequ ence beGPCR-seql^SEQ ID NO. 39> ^aTTdentif ied in 



H . sapiens-. 



TTGTGTGGCAGTAGAGAGATGTCAGGCTTCAGAGTCA^CA^GA^CTGGATTTC^AACTG 



.CCTTTGGTAAGTGACTTATTATCTGCGAGCCTCTGTT 1 



GTGATCACATCTGGTTTGTGTTCCCAGGGGCACC 



CAGCACCGAGCGCTGCCTGTCTGTTCTGTGGCCCATCTGGTACC 

The fol lowing amino acid sequence <SEQ ID NO. 40> i J _thejredict £ d«ml^ 
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acid sequence derived from the DNA sequence of SEQ ID NO. 39: 

LCGSREMSGFRVNKNWISNWIGPPPLVSDLLSASLCFSLLMRTVNPIRQGGGENQRYSWSHLVCVPRGTRLGF 
LSMDPTVPVFGTKLTPINGREETPCYNQTLS F^ 7 LTCII5LVGLTGNWv~vLwLL GYRMRRJ\T AVSIYILNLAAA 
DFLFL5FQI I RSPLRLINISHLIRK IIjVSVMTFPYFTGLSMLSAIST ERCLSVLWPIWY 



The following DNA sequence beGPCR-seq33 <SEQ ID NO. 41> was identified in 
H. sapiens: 

ACAGAAAGCAAGGCCACCAGGACCTTAGGCATAGTCATGGGAGTGTTTGTGTTGTGCTGGCTGCCCTTCTTTG 
TCTTGACGATCACAGATCCTTTCATTAATTTTACAACCCTTGAAGATCTGTACAATGTCTTCCTCTGGCTAGG 
CTATTTCAACTCTGCTTTCAATCCCATTTTATATGGCATGCTTTATCCTTGGTTTCGCAAGGCATTGAGGATG 
ATTGTCACAGGCATGATCTTCCACCCTGACTCTTCCACCCTAAGCCTGTTTTCTGCCCATGCTTAGGCTGTGT 
TCATCATTCAATAGGACTCTTTTCTGG 

The following amino acid sequence <SEQ ID NO. 42> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO . 41: 

TESKATR TLGIVT^GVFVLCWLPFFVXTITDPF INFTTLEDLYTJ VFLWLGYFNSAFNPILYGML YPWFRKAIjRM 
IVTGMIFHPDSSTLSLFSAHAAVFI IQDSF 



The following DNA sequence beGPCR-seq34 <SEQ ID NO. 43> was identified in 
H. sapiens-. 

TAGGAATCTCAGAGAAGAAAGTAAGGAACCAGAAAACCATAAAAGAATGTAAATGGAAAAGAATCAGCAAATC 
TTATTCACTTATCACTAAATCTAAAATATGTCAAAATACATGAAGACAACAAATGCTTTAGAACAACTGTTGA 
ATGTATTGTCCTACAACTTGGCATATGATCATGCTTGCCTCTCTATGTCCAAGTGTTTATTTTTGCAGTTGAC 
CTTAATTTCAAGTTAGTTTTGAGGTCTCTACAGTAATGTTTTTAATCTGTCTCTACTTCTTCAGAAAATAAAT 
TAGTTGTTGACGAATCAGTCCTTAAGACCTTGCCGCTTACAATAAGTTTTATTGCCTTCCCAAACCATTGGTA 
AAAGAAAGCATAAATCAAGGGGTTCATAGCTGAATTATAATAAACACACCAAACTAAAATCTCATAAACATAA 
GGAGGAGTTATAAAATTCATATAAGCATCAATCACTGCATCAACGAGGTATGGTAGCCAAGAGACAAGAAATG 
CTGC 

The following amino acid sequence <SEQ ID NO. 44 > is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 43: 

LHQRG^4VAKRQE MLAAFLVSWLPYLVTJAVIDA YMNFITPPYVYE ILVWCVYYNSA^1NPLIYAFF YQWFGKAIK 
LIVSGKVLRTDSSTTNLFSEEVETDKJ^YCRDLKTNLKXRSTAKINTWTRGKHDHMPSCRTIHSTVVLKHLLSS 
CI 



The following DNA sequence beGPCR-seq35 <SEQ ID NO. 45> was identified in 
H. sapiens: 

CTGGAAAGAGGTCCTCGATCTATCCTCTACGCCGTCCTTGGTTTTGGGGCTGTGCTGGCAGCGTTTGGAAACT 
TACTGGTCATGATTGCTATCCTTCACTTCTAACAACTGCACACACCTACAAACTTTCTGATTGCGTCGCTGGC 
CTGTGCTGACTTCTTGGTGGGAGTCACTGTGATGCCCTTCAGCACAGTGAGGTCTGTGGAGAGCTGTTGGTAC 
TTTGGGGACAGTTACTGTAAATTCCATACATGTTTTGACACATCTTTCTGTTTTGCTTCTTTATTTCATTTAT 
GCTGTATCTCTGTTGATAGATACATTGCTGTTACTGATCCTCTGACCTATCCAACCAAGTTTACTGTGTCAGT 
TTCAGGGATATGCATTGTTCTTTCCTGGTTCTTTTCTGTCACATACAGCTTTTCGATCTTTTACACGGGAGCC 
AACGAAGAAGGAATTGAGGAATTAGTAGTTGCTCTAACCTGTGTAGGAGGCTGCCAGGCTCCACTGAATCAAA 
ACTGGGTCCTACTTTGTTTTCTTCTATTCTTTATACCCAATGTCGCCATGGTGTTTATATACAGTAAGATATT 
TTTGGTGGCCAAGCATCAGGCTAGGAAGATAGAAAGTACAGCCAGCCAAGCTCAGTCCTTCTCAGAGAGTTAC 
AAGGAAAGAGTAGCAAAAAGAGAGAGAAAGGCTGCCAAAACCTTGGGAATTGCTATGGCAGCATTTCTT 

The following amino acid sequence <SEQ ID NO. 46> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 45: 

LERGPRS IL YAVLGFGAVLAAFGNLLVMIAI LHF QLHTPTNF L I ASL ACA DFL VGVTVMPF STVRSVESCWYF 
GPS YCKFH TCFDTSFCFASLFHLCCI SV DRYI AVTDPLTYPTKFTVSVS GICIVLSWFFSVTYS FS I F YTGAN 
EEGI EELWA-LTCVGGCQAPLNQNW VLLCFLL.FFI PNVAMVFI YS KI FLV AKHQARKIESTASQAQSFSESYK 
E R V AKRER KAAKT LG I AMAA F L 
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The following DNA sequence beGPCR-seq36 <SEQ ID NO. 47> was identified in 

H. sapiens: 

AACCAGGTGGCCTTACTCCTAAGACCCCTGGCCTTGTCTATGGCCTTTATCAACAGCTGTCTCAATCCAGTTC 
TCTATGTCTTCATTGGGCATGACTTCTGGGAGCACTTGCTCCACTCCCTGCTAGCTGCCTTAGAACGGGCACT 
TAGCGAGGAGCCAGATAGTGCCTGAATCCCAGCTCCCAGGCAGATGAGTCCTTTATAACATGACCCAATTTCC 
TACTCCATTTTCCCACCACTCAATCCTCTTCCCAAACAGCTCTACCATAATCCAACATCCAACAGAATTTAAG 
AGAATAAACCACAACTTTTAAGTGAGCTCTATGTGCTAGGTCATGTTTTAGAATACAACCTTAAGTGCCTGGA 
AGATGGAGGCAAGAAACAAACAAGGTCTCATTCTTTAGAGGAAGACAGTTCACCAAGACTCAAACAGAAAAAA 
AGATAGTTATCTTGTGACAAAACAAGTCATAAAATTGGGTCAGGACCTGCAGCAATGACTTTATGCTAGAATC 
CAGAGCACTAGCAGGAAACTGCTTAAATTTTACTTAATCAAAGTCAAGTTTGGACATACATGTCAGGTAAAAC 
CTAGCAGAGATGAGCTACCTTGATTTTAAAACTTCAAGGGATAGCTCAATGTCATCAAGATCCTTTTGATGAC 

TTG 

The following amino acid sequence <SEQ ID NO. 48> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 47: 

MHVftTJJ.RPIJU.SMAFI NSCLNPVLWFI GHDFWEHLLHSLIAALERALSEEPDSAIPAPRQMSPLHDPISYS 
IFPPLNPLPKQLYHNPTSNRIENKPQLLSELYVLGHVLEYNLKCLEDGGKKQTRSHSLEEDSSPRLKQKKRLS 
CDKTSHKIGSGPAAMTLCNPEHQETAILLNQSQVWTYMSGKTQRATLILKLQGIAQCHQDPFDDL 



The followi ng DNA sequence beGPCR- seq37 <SEQ ID NO. 49> was identified in 
H. sapiens: 

rCTTGTTCACGGCCACCATCCTCAAGCTGTTGCGCACGGAGGAGGCGCACGGCCGGGAGCAGCGGAGGCGCGC 
GGTGGGCCTGGCCGCGGTGGTCTTGCTGGCCTTTGTCACCTGCTTCGCCCCCAACAACTTCGTGCTCCTGGCG 
CACATCGTGAGCCGCCTGTTCTACGGCAAGAGCTACTACCACGTGTACAAGCTCACGCTGTGTCTCAGCTGCC 
TCAACAACTGTCTGGACCCGTTTGTTTATTACTTTGCGTCCCGGGAATTCCAGCTGCGCCTGCGGGAATATTT 
GGGCTGCCGCCGGGTGCCCAGAGACACCCTGGACACGCGCCGCGAGAGCCTCTTCTCCGCCAGGACCACGTCC 
GTGCGCTCCGAGGCCGGTGCGCACCCTGAAGGGATGGAGGGAGCCACCAGGCCCGGCCTCCAGAGGCAGGAGA 

GTGTGTTCTGAGTCCCGGGGGCGCAGC 

The following amino acid sequence <SEQ ID NO . 50> is the predicted ammo 
acid sequence derived from the DNA sequence of SEQ ID NO. 49: 

Ti|T , T TivT,. P TPP &W P,PRnRRRAV GLAAWLUmCFAPNNFVLLAHI VSRLFYGKSYYHVYKLTLCLSCL 
NNCLDPFVYYFASREFQLRLREYLGCRRVPRDTLDTRRESLFSARTTSVRSEAGAHPEGMEGATRPGLQRQES 

VFVPGAQAAPPGLR 



The followi ng DNA sequence beGPCR-seq38 <SEQ ID NO. 51> was identified in 
H. sapiens: 

TTACTTATTCTGCCCTTTATCCAACTTTTAATTCCCTTTGCTATTCTCCTGCCTCATTTTCTGGCCTCATTTT 
CCCTATTATCCTGCCTCACATTGATCAAGGGATGAGGCTGGCAGGATCCGGAACCCACAGGGCCCCGTGGGCC 
ATGAGAGGCTCCTGGACTTGAACCTCAGGACACTCCCACTCTGGCTGCCGGCAGGGATGGAAGCTGGATGAGC 
AGGCAGGAGCTGGCAGTGGGGGTGGAGAGCCATAGGCTATTGGGGTGGACAGGCTTGGGTGCCTCATGGGAGC 
TCCCCATGGGAGCTGTGGCCCCTTGGGGCCTCTTATTTCTCACCCCAGGCTTTCCCGGGAGAGGTTCAAGTCA 
GAAGATGCCCCAAAGATCCACGTGGCCCTGGGTGGGAGCCTGTTCCTCCTGAATCTGGCCTTCTTGGTCAATG 
TGGGGAGTGGCTCAAAGGGGTCTGATGCTGCCTGCTGGGCCCGGGGGGCTGTCTTCCACTACTTCCTGCTCTG 
TGCCTTCACCTGGATGGGCCTTGAAGCCTTCCACCTCTACCTGCTCGCTGTCAGGGTCTTCAACACCTACTTC 

GGGCACTACTTCCTGAAGC 

The following amino acid sequence <SEQ ID NO. 52> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 51: 

ETYSALYPTFNSLCYSPASFSGLIFPIILPHIDQGMRLAGSGTHRAPWAMRGSWTTSGHSHSGCRQGWKLDEQ 
AGAGSGGGEPAiGVPRLGCLMGAPHGSCGPLGPLISHPRLSRERFKSEDAPKI HVALGGSLFLLNLAFLVNVG 
sr,SKGSDAACWARGA VFHYFLLCAFTWMGLEAFHLYLL AVRVFNTYFGHYFL 



The follow i ng DNA sequence beGPCR-3eq40 <SEQ ID NO. 53> w as i dentified in 
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H. sapiens: 

AATTGGTCGGAGAGTGCAGCTGCTTGAAATGGAGGATTGAAATCATCACCAGGAGGTTTCCAAACACAGCCAG 
CACAGCCCCAAAGCCAAACACTATGTACAGAATCACCCGGGATCCCGGCGAGAAGGGGATTTTCACACAGGAC 
CCATTCACGTTCGCGTAGCACAGCTGCACAGCCACCAGCAGGGATGAATTGCTGCTCATAACGCTGGTATTTA 
CATATGGAGAAATTTTGTCCTTGTTGATTATCACAAAAAATACAGGATTGTTCCTGATTTTCATTGCTCCTGC 
GGAAAAAAACACATATTCACCAGGATGCCAGAGGAAATGATCA 

The following amino acid sequence <SEQ ID NO. 54> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 53: 

DHFLWHPGEYVFFSAGAKKIRNNPVFFVIINKDKISPYVNTSVMSSNSSLLVAVQLCYANVNGSCVKIPFSPG 
SR VILYIVFGFGAVLAVFGNLLVMIS I LHFKQLHSPTN 

The following DNA sequence beGPCR-seq41 <SEQ ID NO. 55> was identified in 
H. sapiens : 

CACATCTTAACAAGACTGAAAAACATTGATTTGTTTTTAATTTGAAGAGCAATTTATTTGCTATTCATTCATA 
GTCTTACTTGATTTTTAAAAACTCATTTCGCTTGGTAATTTTAAAGGTATCCTGAACTTCGTCTATCCAACTG 
CTTATATATGTTCAGAAAACAAATTCATGGTTGCTGAACTGTTCTTTAAAACCTGACOAGTTACAATAACTTT 
TATTGCTTTCCTAAACCATGGGTAAAATAAAGCATAAATCAAAGGATTCATGGCTGAGTTATAATAAGCACAC 
CAACAGCATCATAAATACAGGCAGGGGTTATAAAGCCCATAAAGGCATCAATTAATGAATCAATGCTATATGG 
TAACCATGAAATCATAAATGCTACCACTGTGACCCCCAGGGTTTTAGCTGCTTTTCTCTCTCTCCTGGCCACT 
CTGGCTTTGTAACTCTCTGAGGATGATTCTGTCTTGCTACCAGTATTTTCTATCTTTTTCGCCTGTCGTCTAG 
CCACAAGAAATATGTTACCATACAGAATTATCATAATAAAGGTAGGTATAAAGAAGGATAGAAAATCTGTCAA 
CA 

The following amino acid sequence <SEQ ID NO. 56> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 55: 

LTDFLSFFIPTFIMI ILYGN I FLVAJ^RQAKKIENTGSKTESSSES YKARVAP^RERKA AKTLGVTVVAFMI SWL 
PYSIDSLID AFMGFITPACIYEICCWCAY YNSAMNPLIYALFYPWFRKAIKVIVTGOVLKNSSATMNLFSKHT 
AVGTKFRIPLKLPSEMSFKSSKTMNEQINCSSNKQINVFQSCDV 

The following DNA sequence nGPCR-seq53 <SEQ ID NO. 57> was identified in 
H. sapiens: 

TTTGTGGCAAGGAGACCCTGATCCCGGTCTTCCTGATCCTTTTCATTGCCCTGGTCGGGCTGGTAGGAAACGG 
GTTTGTGCTCTGGCTCCTGGGCTTCCGCATGCGCAGGAACGCCTTCTCTGTCTACGTCCTCAGCCTGGCCGGG 
GCCGACTTCCTCTTCCTCTGCTTCCAGATTATAAATTGCCTGGTGTACCTCAGTAACTTCTTCTGTTCCATCT 
CCATCAATTTCCCTAGCTTCTTCACCACTGTGATGACCTGTGCCTACCTTGCAGGCCTGAGCATGCTGAGCAC 
CGTCAGCACCGAGCGCTGCCTGTCCGTCCTGTGGCCCATCTGGTATCGCTGCCGCCGCCCCAGACACCTGTCA 
GCGGTCGTGTGTGTCCTGCTCTGGGCCCTGTCCCTACTGCTGAGCATCTTGGAAGGGAAGTTCTGTGGCTTCT 
TATTTAGTGATGGTGACTCTGGTTGGTGTCAGACATTTGATTTCATCACTGCAGCGTGGCTGATTTTTTTATT 
CATGGTTCTCTGTGGGTCCAGTCTGGCCCTGCTGGTCAGGATCCTCTGTGGCTCCAGGGGTCTGCCACTGACC 
AGGCTGTACCTGACCATCCTGCTCACAGTGCTGGTGTCCCTCCTCTGCGGCCTGCCCTTTGGCATTCAGTGGT 
TCCTAATATTATGGATCTGGAAGGATTCTGATGTCTTATTTTGTCATATTCATCCAGTTTCAGTTGTCCTGTC 
ATCTCTTAACAGCAGTGCCAACCCCATCATTTACTTCTTCGTGGGCTCTTTTAGGAAGCAGTGGCGGSTGCAG 
CACCCGATCCTCAAGCTGGCTCTCCAGAGGGCTCTGCAGGACATTGCTGAGGTGGATCACAGTGAAGGATGCT 
TCCGTCAGGGCACCCGGAGATTCAAAGAAGCATTCTGGTGTAGGGATGGACCCCTCTACTTCCATCATATATA 
TGTGGCTTTGAGAGGCAACTTTGCCCC 

The following amino acid sequence <SEQ ID NO. 58> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 57: 

CGKETLI PVFLILFIALVGLVGNGFVLWLLGFRMRRNAFSVYVLSLAGADFLFLCFQI INCLVYLSNFFCSXS 
INF PSFFTTV>lTCAYLAGLSML Sjr^STERCL 

FSDGDSGWCQTFD FITAAWLIFLFMVLCGSSLALL VRILCGSRGLPLTRL YLTILLTVLVSLLCGLPFGIQW F 
LILWIWKTJSDVLFCHIHPVS WLS SLNSSANP 1 1 YFFVG S FRKQWRXQHP I LKLAL.QRALQDI AEVDHS EGCF 
RQGTRRFKEAFWCRDGPLYFHH I YVALRGNFA 
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The following DNA sequence nGPCR- seq54<SEQ ID NO. 59> was identified in 
H. sapiens: 

CTTTGCATCTCACTGTTGAGCAGACAGCCTGCTGAAAGTTGTCGCTGACCACCACATATAGTAACAGGTTACC 
AAAGGTGTTCAGAGCAGCATAATGGTCTAGAAACGATGTAAGCTTCATGGATCTGATTCTCAATGGAACAACT 
GATTGAAAGCAGGCTGAGATTCGATCCTGAATGACCCTCAAGATATGGAAGGGTAAAAAACATACGTAAAATG 
CAAGGAGTAGCAGAATGGTTAGCCTTCGTGCTTTCTGCTTAAGGCAGCTGTCAGTTTGCAGTCCATGGGTCAA 
AGTGTGGATAATCGTGGTATAGCAAAGTGTCACTATCACCAAGGGGAGGCAGAAAGTACTTGCAGTCAAAATC 
AGGTTGTACCACTTAATAGTATTGAGTTCATCCGAACTGGTGAGGTCGAGACAGGCTGATCTGTTGGTCCTGT 
TGGTTGATGTGATCAAGAAGGTCATCGGAATGACAGCTACCAGTGAAATGATCCACACCACAGCACAGGCTAC 
AACTGCACATCGAGTTTTGTGAATGGAAAAGCAGCTCATTGGGTGAATGATCACACAGTAGCGGAAG 

The following amino acid sequence <SEQ ID NO. 60> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 59: 

FRYCVIIHPMSCFSIHKTRCAWACAVWIISLVAVIPMTFLITSTNRTNRSACLDLTSSDELNTIKWYNLIL 
TASTFCLPLVI VTLCYTTI IHTLTHGLQTDSCLKQKARRLTILLLLAFYVCFLPFHILRVIQDR I SACFQSW 



PLRIRSMKLTSFLDHYAALNTFGNLLLYWVSDNFQQAVCSTVRCK 



The following DNA sequence nGPCR-seq55 <SEQ ID NO, 61> was identified in 
H. sapiens, where the underlined ATG identifies a probable start codon : 



GGGAGGGCTCGTAGACACACTAACCCTACCCTTTCTGTTTCTTCCTCATCTTTCCTTTCCATCTGTTTCTCAT 
GGTCTCCTGTCTGTCTCTCTCTCTCTCCCCTCTTTCTCTCTCCTCGCTCTTTCTCATCCCCTCCATTTCTGTG 
TCAATCTCAATCCATTTATATCGGTGGCCACTTTTCTATCTCTTTGTTCTATCTCTCTCTCTCTCTCTTTCCC 
ACTTTGTCTCTGCACGCCTGTTGTGTTTTTCTGCCTGTCTCTCTCTTGCCCTCATCTCTCTGTCTCTCTCTTG 
CCCTCATCTCTCTGTCTCTCTGTGTCTGTGTCTCCCCCGCTCATTCCCATTTGCAGGTGCAATGTAGCAGGAC 
AACTCATGGAGCCCCCCCGGGCCCATCGAGTACCGGACTGGCTGACCCCCTAGGGTTGGCAGTAGCCCCTGAC 
CCTCAGTATGGCCAACACTACCGGAGAGCCTGAGGAGGTGAGCGGCGCTCTGTCCCCACCGTCCGCATCAGCT 
TATGTGAAGCTGGTACTGCTGGGACTGATTATGTGCGTGAGCCTGGCGGGTAACGCCATCTTGTCCCTGCTGG 
TGCTCAAGGAGCGGGCCCTGCACAAGGCTCCTTACTACTTCCTGCTGGACCTGTGCCTGGCCGATGGCATACG 
CTCTGCCGTCTGCTTCCCCTTTGTGCTGGCTTCTGTGCGCCACGGCTCTTCATGGACCTTCAGTGCACTCAGC 
TGCAAGATTGTGGCCTTTATGGCCGTGCTCTTTTGCTTCCATGCGGCCTTCATGCTGTTCTGCATCAGCGTCA 
CCCGCTACATGGCCATCGCCCACCACCGCTTCTACGC.CAAGCGCATGACACTCTGGACATGCGCGGCTG 

The following amino acid sequence <SEQ ID NO. 62> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 61: 

MANTTGEPEEVSGAXjSPPSASAYVKLVLL^ 

vrFPFVT.ASVRHGSSWTFSA LSCKIVAFMAVLFCFHAAFMLFCISV TRYMAIAHHRFYAKRMTLWTCAAE 



The following DNA sequence nGPCR-seq56 <SEQ ID NO. 63> was identified in 
H. sapiens-. 

AAAAATTGCTGTACTGAACTATTGAATGGAACTTGGAAATAAAGTCCCTTCCAAAATAACTATTCTTCAACAG 
AGAGTAATAGGTAAATGTTTTAGAAGTGAGAGGACTCAAATTGCCAATGATTTACTCTTTTATTTTTCCTCCT 
AGGTTTCTGGGATAAGTATGTGCAAATAAAAAATAAACATGAGAAGGAACTGTAACCTGATTATGGATTTGGG 

aaaaagataaatcaacacacaaagggaaaagtaaactgattgacagccctcaggaatgatgcccttttgccac 
aatataattaatatttcctgtgtgaaaaacaactggtcaaatgatgtccgtgcttccctgtacagtttaatgg 
tgctcataattctgaccacactcgttggcaatctgatagttattgtttctatatcacacttcaaacaacttca 
taccccaacaaattggctcattcattccatggccactgtggactttcttctggggtgtctggtcatgccttac 
agtatggtgagatctgctgagcactgttggtattttggagaagtcttctgtaaaattcacacaagcaccgaca 
ttatgctgagctcagcctccattttccatttgtctttcatctccattgaccgctactatgctgtgtgtgatcc 
actgagatataaagccaagatgaatatcttggttatttgtgtgatgatcttcattagttggagtgtccctgct 
gtttttgcatttggaatgatctttctggagctaaacttcaaaggcgctgaagagatatattacaaacatgttc 
actgcagaggaggttgctctgtcttctttagcaaaatatctggggtactgacctttatgacttctttttatat 
acctggatctattatgttatgtgtctattacagaatatatcttatcgctaaagaacaggcaagattaattagt 

G ATG C C AAT C AG A 

The following amino acid sequence <SEQ ID NO. 64> is the predicted amino__ 
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acid sequence derived from the DNA sequence of SEQ ID NO. 63: 

rektdqpsgmmpfchni iniscvknnwsndvrasly slmvli ilttlvgnlivivsishf kqlhtptn wlihs 
i^tvdfll gciivmpy smvrsaehcwyfgevfc kihtstdimlssasifhlsfisi dryyavcdplrykakmni 
lvicvmif'iswsvpavfafgmifl elnfkgaeeiyykhvhcrgg csvffskisgvltfmtsfyipgsiml cvy 

YRIYLIAKEQARLISDANQ 

The following DNA sequence nGPCR- seq57<SEQ ID NO. 65> was identified in 
H . sapiens- 

AACAGTCCCGGGTGGAACCTGGGCATGTATATTTTGATTGTTTTATGCATACTCCTAGTGAAGAACCAATGTC 
TTGCTCAGATAGAAGCAAGATACTCAGACTTAGTTTCTCTGTAGCTCCTGCTTTTTATTATTCCTGGTTGGAT 
TGCACCACTACTCAGTTTCTATTTTATAATACTGATTATAAAACATGGGAGGGAAATAACTTTGTATTGGTTT 
TTATGGATAATTTATTATGTGTCCTAGACTCTGGCCTTGTCAAAAGAAGGACGTAAGAAGGCACGATGTATTA 
TACTTGGGAATGATAGAAGAGACTGACCTGGTATTTCCACCCGGAAGAGGGAAAGGATTTTAACTACAAATAC 
AGGAATCCAGCAGATGGCATCAGAGAACACTATAAAAAAGAAACGATTTGCAACAGCCACCTCTCTTCCAAAA 
CAATTCCTTACTTCTGTGGTCTGCAAGGCGGTTTTTTGAATGGAACAGAACATAGTAATATAGGAAAACACAA 
TGATGAGAAAAGCCAGCAAGTTCACACCTGTTGGGGAAAAGCACACTTTTAACATCTCAGGCGTAAAAGTCAA 
CAGTAAAATTACTGTGGTACAGGTTGAGTATCCCTTACCCAAAATGTTTGAAACCAGAAATGTTTTGGATTTC 
GGATTTCGGAATATTTACACATTCATAATGATATATCTTGGAAATGGTTCCCAAGTCTAAACACAAAATTTAT 
TTATGTTTCATATACACCTTATACACATAGTCTGAAAGTAATTTTGTACAATATTTTAAATAATTTTGGGCAT 
GAAACAAAGTTTGCATACATTGAACCATCAGACAGCAAAAGCTTCAGGTGTGGAATTTTCCACTTGTGGCATC 
ATGTTGATGCTCAAAAAGTTCCATATTTTAGAGCATTTCAAATTTTGGATTTTCAAATTACAAATGCTTAACC 
TGTACTTAGATGTTAAATACAGTGCCTCTTCCACGGGCACTTTCAGGAAGCATTCTTTTATATAAGCCC 

The following amino acid sequence <SEQ ID NO. 66> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 65: 

YIKECFLKVPVEEALYLTSKYRLSICNLKIQNLKCSKIWNFLSINMMPQVENSTPEAFAVWFNVCKLCFMPKI 
INIVQNYFQTMCIRCININKFCVTWEPFPRYI IMNVIFRNPKSKTFLVSNILGKGYSTCTTVILLLTFTPEML 
KTVCFS PTGVNLLAFLI I VFSYITMFCS IQ KTALQTTEVRNCFGREVAVAN RFFFI VFSDAICWIPVFVVKILS 
LFRVEI PGQSLLSFPS I IHRAFLRPSFDKARVDTI IHKNQYKVISLPCFI ISI IKKLSSGAIQPGI IKSRSYR 
ETKSEYLAS I ARHWFFTRSMHKTIKI YMPRFHPGL 



The following DNA sequence nGPCR-seq58 <SEQ ID NO. 67> was identified in 
H. sapiens i 

ACTACCATGGAAGGTGACCTGGGTGCCACTGGCCACAGGCCCCGCACAGAGCTTGATGATGAGGACTCCTACC 
CCCAAGGTGGCTGGGACACGGTCTTCCTGGTGGCCCTGCTGCTCCTTGGGCTGCCAGCCAATGGGTTGATGGC 
GTGGCTGGCCGGCTCCCAGGCCCGGCATGGAGCTGGCACGCGTCTGGCGCTGCTCCTGCTCAGCCTGGCCCTC 
TCTGACTTCTTGTTCCTGGCAGCAGCGGCCTTCCAGATCCTAGAGATCCGGCATGGGGGACACTGGCCGCTGG 
GGACAGCTGCCTGCCGCTTCTACTACTTCCTATGGGGCGTGTCCTACTCCTCCGGCCTCTTCCTGCTGGCCGC 
CCTCAGCCTCGACCGCTGCCTGCTGGCGCTGTGCCCACACTGGTACCCTGGGCACCGCCCAGTCCGCCTGCCC 
CTCTGGGTCTGCGCCGGTGTCTGGGTGCTGGCCACACTCTTCAGCGTGCCCTGGCTGGTCTTCCCCGAGGCTG 
CCGTCTGGTGGTACGACCTGGTCATCTGCCTGGACTTCTGGGACAGCGAGGAGCTGTCGCTGAGGATGCTGGA 
GGTCCTGGGGGGCTTCCTGCCTTTCCTCCTGCTGCTCGTCTGCCACGTGCTCACCCAGGCCACAGCCTGTCGC 
ACCTGCCACCGCCAACAGCAGCCCGCAGCCTGCCGGGGCTTCGCCCGTGTGGCCAGGACCATTCTGTCAGCCT 
ATGTGGTCCTGAGGCTGCCCTACCAGCTGGCCCAGCTGCTCTACCTGGCCTTCCTGTGGGACGTCTAGTCTGG 
CTACCTGCTCTGGGAGGCCCTGGTCTACTCCGACTACCTGATCCTACTCAACAGCTGCCTCAGCCCCTTCCTC 
TGCCTCATGGCCAGTGCCGACCTCCGGACCCTGCTGCGCTCCGTGCTCTCGTCCTTCGCGGCAGCTCTCTGCG 
AGGAGCGGCCGGGCAGCTTCACGCCCACTGAGCCACAGACCCAGCTAGATTCTGAGGGTCCAACTCTGCCAGA 
GCCGATGGCAGAGGCCCAGTCACAGATGGATCCTGTGGCCCAGCCTCAGGTGAACCCCACACTCCAGCCACGA 
TCGGATCCCACAGCTCAGCCACAGCTGAACCCTACGGCCCAGCCACAGTCGGATCCCACAGCCCAGCCACAGC 
TGAACCTCATGGCCCAGCCACAGTCAGATTCTGTGGCCCAGCCACAGGCAGACACTAACGTCCAGACCCCTGC 
ACCTGCTGCC 

The following amino acid sequence <SEQ ID NO. 68> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 67: 

TTMEADLGATGHRPRTELDDEDS YPQGGWDT VFLVA ItLLL GLPANGLMAWL AGSQARHGAGTR LALLL 
LSLALSDFLiFLAAAAFQILEIRHGGHWPLiGTAACRFYYFLjWGVS YSSGLiFLLAALSLDRCLLALCPHW 
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Y p^HP™""-PT.WV CAG\^LATLFSVPWLVFP EAA^^ 

FAAALCEERPGSFTPTEPQTQLDSEGPTLP 



EELSLRMLEVLiGGFLPFL 



LLLVCHVLTQA TACRTCHRQQQPAACRGFARVARTI 
FALVYSD YLILLNSCLSPFLCLMASA DLRTLLRSVLSS 

EP1 



^AEA^SQMDPVAQPQVNPTLQPRSDPTAQPQLNPTAQPQSDPTAQPOLNLMAQPQSDSVAQPQADT 



NVQTPAPAA 



The following DMA se quence nGPCR- 3e q59 <SEQ ID HO. 69> was identified in 
H. sapiens: 

TACAGGCCTGAGCATGCTGGGCTCCATCAGCACCAAGCACTGCCTGTCCATCCTGTGGCCCATCTAGTACCGC 
TGCCACCACCCCACACACCTGTCAGCAGTCGTGTGTCCTGCTCTGGGCCC 

GA^TGGATGTTCTGTGGCTTCCTGTCTAGTGGTGCTGATTCTGTTTGGTGTGAAACATCAGATTTCATCACAG 



icACATGGCTGATTTTTTTATGTGTGGTTCTCTGCGGGTCCAGCCCGGTTCTGCTGGTCAGGATCCTTTGTGG 

ATCCC 

tt^gtcc^^ 



ATCCCGGAAGATGCCCTTGACCAGGCTGTACATGACCATCCTGCTCAGAGTGCTGGTCTTCCTCCTCTGTGAC 

cc^ggcattcLtgattcctatttttctggatccacgtggatttgtcacgttcgtctagtttccat 



AG 



The following a.inc acid sequence <SEQ ID NO. 70> i. . the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 69. 
VRPEHAGLHOHQALPVHPVAHLVPLPPPHTPVSSRVSCSGPCPCCRA 5WNGCSVASCLVVLILFG VKHQISSQ 
SHGl^FSAGtAR^WSGSFVDPGRCPPGCTPSCSECW^ 

TLNSSAI^II VFFMGSFRQLQNRKTLLVLQRAIjQDTPEVEEGRWRLSEETLELSSRLGPGRASALSV 



The following DNA se quence nGPCR-seqSO <SEQ ID NO. 71> was identified in 

H. sapiens: 

^CGAAGGCAGGCCGCAGAAGAGAAGAGGAGGACGGTGAGGAGGATGAGCCCAGGGAAGCCCCGGGGTGGG 

ITAGCA' 

'"'GGCAC 

3CCCAGGTGAGGGCGCACACACAGGTGGTCAGGTGG 



GGCCGCTGGGGGCCTCGCTCCACCCGCAGCAGCAGCATAAGGCTGG 

CCAGCAGCACCGCTGCCACCAGCCACAGCGTCCGGCACAAGTGGCGGCTGGGCTCCCCGAAGAA 

ggcgccgctgagcIgcaggtgcagcagcaggcagagggcccaggtgagggcgcacacacaggtggtcaggtc 

CGTGGGCGGCGGCACGAGTACCAGGCTGGGAAGAGGGCGGCCAGGCACTGCTCCACGCTGACGGCCGCCAGGA 



gactcaggcccacgatgtagcacaagaagc^^ 

CCGGCCTTGCAGCAAGTCGGGGACGATGGCCACCATGTGGCAGCCAAGGAAGATGAGATCCGCGCAGGCCACG 
TCCAGGAGGTAGATGGCGAAAGGGTTTCTGTAGACATTGGAGCTGAGC 

The following amino acid sequence <SEQ ID NO 72> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 71. 
TSSrm'RNPFAIYLLDVACADLIFLGCHMVAIVPDLLQGRLDFPGFVQTSLATLRFFCYIVGLSI^A 



LSGACT QFFGEPSRHLC RTLWLVAAVL 
LFSSAACLRH 



I^Llcctmcgaslmlllrvergpqrppprgfpglilltvl 



dentified in H . 



ThTT^llowing DNA sequence nGPCR-1 <SEQ ID NO. 73, was i 
sapiens : 

ATGGAATCATCTTTCTCATTTGGAGTGATCCTTGCTGTCCTGGCCTCCCTCATCATTGCTACTAACACACTAG 
T^GCTGTGGCTGTGCTGCTGTTGATCCACAAGAATGATGGTGTCAGTCTCTGCTTCACCTTGAATCTGGCTGT 

ggctgacaccttgattggtgtLccatctctggcctactcacagaccagc 

cagaIgaccctgtgcagcctgcggatggcatttgtcacttcctccgcagctgcctctgtcctcacggtcatgc 
tgatcacctttcIcaggtaccttgccatcaagcagcccttccgctacttgaagatcatgagtgggttcgtggc 

rCGGGCCTGCATTGCCGGGCTGTGGTTAGTGTCTTACCTCATTGGCTTCCTCCCAC 

rAGCAGACTGCCTACAAAGLcAGTGCAGCTTCTTTGCTGTATTTCACCCTCACTTCGTGCTGACCCTCTCCT 
rrGTTGGCTTC^CCC^CCATGCTCCTCTTTGTCTTCTTCTACTGCGACATGCTCAAGATTGCCTCCATGCA 

rACTTCAA^GCTCTCCOTACTGTGTCTGTTCTCATTGGGAGCTTTGCTCTATCCTGGACCCCCTTCCTTATCA 

ctggcat^tgcaggtggcctgccaggagtgtcac^ 
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CGTGGGCAACTCCCTGCTCAACCCACTCATCTATGCCTATTGGCAGAAGGAGGTGCGACTGCAGCTCTACCAC 
ATGGCCCTAGGAGTGAAGAAGGTGCTCACCTCATTCCTCCTCTTTCTCTCGGCCAGGAATTGTGGCCCAGAGA 
GGCCCAGGGAAAGTTCCTGTCACATCGTCACTATCTCCAGCTCAGAGTTTGATGGCTAA 

The following amino acid sequence <SEQ ID NO. 74> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 73: 

MESSF SFGVILAVLASLIIATNTLVAVAVL LLIHK3JDGV SLCFTLNLAVADTLIGVAISGLLT DQLSSPSRPT 
QKTLCSLR MAFVTSSAAASVLTVMLITF DRYLAIKQPFRYLKIMSG FVAGACIAGLWLVSYLIGFLPLGIP MF 
QQTAYKGQCSFFAVFHP HFVLTLSCVGFFPAMLLFVFFYCDM LKIA5MHSQQIRKMEHAGAMAGGYRSPRTPS 
DFKAIjRT VSVLIGSFALSWTPFLITGI VQ VACQECHLYLVLER YLWLLGVGNSLLNPLI YAYW QKEVRLQLY 
HMALGVKKVXTSFLLFLSARNCGPERPRESSCHI VTISSSEFDG 



The following DNA sequence TL-GPCR- seq5 <SEQ ID NO. 75> was identified in 

H . sapiens. 

AACTGGAAGGGCAGCCGTCTGCCGCCCACGAACACCTTCTCAAGCACTTTGAGTGACCACGGCTTGCAAGCTG 
GTGGCTGGCCCCCCGAGTCCCGGGCTCTGAGGCACGGCCGTCGACTTAAGCGTTGCATCCTGTTACCTGGAGA 
CCCTCTGAGCTCTCACCTGCTACTTCTGCCGCTGCTTCTGCACAGAGCCCGGGCGAGGACCCCTCCAGGATGC 
AGGTCCCGAACAGCACCGGCCCGGACAACGCGACGCTGCAGATGCTGCGGAACCCGGCGATCGCGGTGGCCCT 
GCCCGTGGTGTACTCGCTGGTGGCGGCGGTCAGCATCCCGGGCAACCTCTTCTCTCTGTGGGTGCTGTGCCGG 
CGCATGGGGCCCAGATCCCCGTCGGTCATCTTCATGATCAACCTGAGCGTCACGGACCTGATGCTGGCCAGCG 
TGTTGCCTTTCCAAATCTACTACCATTGCAACCGCCACCACTGGGTATTCGGGGTGCTGCTTTGCAACGTGGT 
GACCGTGGCCTTTTACGCAAACATGTATTCCAGCATCCTCACCATGACCTGTATCAGCGTGGAGCGCTTCCTG 
GGGGTCCTGTACCCGCTCAGCTCCAAGCGCTGGCGCCGCCGTCGTTACGCGGTGGCCGCGTGTGCAGGGACCT 
GGCTGCTGCTCCTGACCGCCCTGTCCCCGCTGGCGCGCACCGATCTCACCTACCCGGTGCACGCCCTGGGCAT 
CATCACCTGCTTCGACGTCCTCAAGTGGACGATGCTCCCCAGCGTGGCCATGTGGGCCGTGTTCCTCTTCACC 
ATCTTCATCCTGCTGTTCCTCATCCCGTTCGTGATCACCGTGGCTTGTTACACGGCCACCATCCTCAAGCTGT 
TGCGCACGGAGGAGGCGCACGGCCGGGAGCAGCGGAGGCGCGCGGTGGGCCTGGCCGCGGTGGTCTTGCTGGC 
CTTTGTCACCTGCTTCGCCCCCAACAACTTCGTGCTCCTGGCGCACATCGTGAGCCGCCTGTTCTACGGCAAG 
AGCTACTACCACGTGTACAAGCTCACGCTGTGTCTCAGCTGCCTCAACAACTGTCTGGACCCGTTTGTTTATT 
ACTTTGCGTCCCGGGAATTCCAGCTGCGCCTGCGGGAATATTTGGGCTGCCGCCGGGTGCCCAGAGACACCCT 
GGACACGCGCCGCGAGAGCCTCTTCTCCGCCAGGACCACGTCCGTGCGCTCCGAGGCCGGTGCGCACCCTGAA 
GGGATGGAGGGAGCCACCAGGCCCGGCCTCCAGAGGCAGGAGAGTGTGTTCTGAGTCCCGGGGGCGCAGCTTG 
GAGAGCCGGGGGCGCAGCTTGGAGGATCCAGGGGCGCATGGAGAGGCCACGGTGCCAGAGGTTCAGGGAGAAC 
AGCTGCGTTGCTCCCAGGCACTGCAGAGGCCCGGTGGGGAAGGGTCTCCAGGCTTTATTCCTCCCAGGCACTG 
CAGAGGCACCGGTGAGGAAGGGTCTCCAGGCTTCACTCAGGGTAGAGAAACAAGCAAAGCCCAGCAGCGCACA 
GGGTGCTTGTTATCCTGCAGAGGGTGCCTCTGCCTCTCTGTGTCAGGGGACAGCTTGTGTCACCACGCCCGGC 
TAATTTTTGTATTTTTTTTAGTAGAGCTGGGCTGTCACCCCCGAGCTCCTTAGACACTCCTCACACCTGTCCA 
TACCCGAGGATGGATATTCAACCAGCCCCACCGCCTACCCGACTCGGTTTCTGGATATCCTCTGTGGGCGAAC 
TGCGAGCCCCATTCCCAGCTCTTCTCCCTGCTGACATCGTCCCTTAGCACACCTGTCCATACCCGAGGATGGA 
TATTCAACCAGCCCCACCGCCTACCCGACTCGGTTTCTGGATATCCTCTGTGGGCGAACTGCGAGCCCCATTC 
CCAGCTCTTCTCCCTGCTGACATCGTCCCTTAGTTGTGGTTCTGGCCTTCTCCATTCTCCTCCAGGGGTTCTG 
GTCTCCGTAGCCCGGTGCACGCCGAAATTTCTGTTTATTTCACTCAGGGGCACTGTGGTTGCTGTGGTTGGAA 
TTCTTCTTTCAGAGGAGCGCCTGGGGCTCCTGCAAGTCAGCTACTCTCCGTGCCCACTTCCCCTCACACACAC 
ACCCCCCTCGTGCCGAATTC 

The following amino acid sequence <SEQ ID NO. 76> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 75. 

MQVPNSTGPDNATLQMLR NPAIAVALPWYSLVAAVSIPG NL.FSLWVLCRRMGPRSPS VIFMINLSVTDLMLA 
SVLPFQIYYHCNRHHWVFGVLL CNVVTVAFYANMYSSILTMTCISV ERFLGVLYPLSSICRWRRRRY AVAACAG 
TWLLLLT ALS PLARTDLTYPVHADG I ITCFDVLKWTMLPS VAWWA VFLFTI FI LLFL I PFVITVAC YTATI LK 
LLRTEEAHGREQRRR AVGL A AVVLLAFVTCFAPNNFVLLA HI VSRLFYGK S YYHVYKXTLCLSCLNNCLDPFV 
YYFASREFQLRLREYLGCRRVPRDTLDTRRESLFSARTTSVRSEAGAHPEGMEGATRPGLQRQESVF 



The following DNA sequence nGPCR-9 <SEQ ID NO. 77> was identified in H. 
sapi ens : 

ATGGAGTCGGGGCTGCTGCGGCCGGCGCCGGTGAGCGAGGTCATCGTCCTGCATTACAACTACACCGGCAAGC 
TCCGCGGTGCGCGCTACCAGCCGGGTGCCGGCCTGCGCGCCGACGCCGTGGTGTGCCTGGCGGTGTGCGCCTT 
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CATCGTGCTAGAGAATCTAGCCGTGTTGTTGGTGCTCGGACGCCACCCGCGCTTCCACGCTCCCATGTTCCTG 
CTCCTGGGCAGCCTCACGTTGTCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACATCCTACTGTCGGGGC 
CGCTCACGCTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCGTGGCACTCACTGCGTC 
CGTGCTGAGCCTCCTGGCCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGGGGGCCCGCGCCCGTCTCC 
AGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGGGGCGTGTCGCTGCTCCTCGGGCTCCTGCCAGCGC 
TGGGCTGGAATTGCCTGGGTCGCCTGGACGCTTGCTCCACTGTCTTGCCGCTCTACGCCAAGGCCTACGTGCT 
CTTCTGCGTGCTCGCCTTCGTGGGCATCCTGGCCGCTATCTGTGCACTCTACGCGCGCATCTACTGCCAGGTA 
CGCGCCAACGCGCGGCGCCTGCCGGCACGGCCCGGGACTGCGGGGACCACCTCGACCCGGGCGCGTCGCAAGC 
CGCGCTCGCTGGCCTTGCTGCGCACGCTCAGCGTGGTGCTCCTGGCCTTTGTGGCATGTTGGGGCCCCCTCTT 
CCTGCTGCTGTTGCTCGACGTGGCGTGCCCGGCGCGCACCTGTCCTGTACTCCTGCAGGCCGATCCCTTCCTG 
GGACTGGCCATGGCCAACTCACTTCTGAACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGC 
TCCTGCGCCTGGTCTGCTGCGGACGCCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGCGC 
GGCTGAGGCTTCCGGGGGCCTGCGCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTCAGCGGCTCGGAGCGC 
TCATCGCCCCAGCGCGACGGGCTGGACACCAGCGGCTCCACAGGCAGCCCCGGTGCACCCACAGCCGCCCGGA 
CTCTGGTATCAGAACCGGCTGCAGACTGA 

The following amino acid sequence <SEQ ID NO. 78> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 77: 

MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLR ADAWCLAVCAFIVLENLAVLLVL GRHPRFHAP MFL 
LLGSLTLSDLLAGAAYAANILL SGPLTLKLSPALWFAR EGGVFVALTASVL SLLAIALERSLTMARRGPAPVS 
SRGRT LAMAAAAWGVSLLLGLLPAJLG WNCLGRLDACSTVLP LYAKAY^ 

RANARRLPARPGTAGTTSTRARRKPRSLALLRTL. SWLLAFVACWGPLFLLLLLDVACP ARTCPVLLQAD PFL 
GLAMANSLLNPI I YTLT MRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGS FSGSER 
SSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD 



The following DNA sequence nGPCR-11 <SEQ ID NO. 79> was identified in H. 
sapi ens : 

ATGTACAACGGGTCGTGCTGCCGCATCGAGGGGGACACCATCTCCCAGGTGATGCCGCCGCTGCTCATTGTGG 
CCTTTGTGCTGGGCGCACTAGGCAATGGGGTCGCCCTGTGTGGTTTCTGCTTCCACATGAAGACCTGGAAGCC 
CAGCACTGTTTACCTTTTCAATTTGGCCGTGGCTGATTTCCTCCTTATGATCTGCCTGCCTTTTCGGACAGAC 
TATTACCTCAGACGTAGACACTGGGCTTTTGGGGACATTCCCTGCCGAGTGGGGCTCTTCACGTTGGCCATGA 
ACAGGGCCGGGAGCATCGTGTTCCTTACGGTGGTGGCTGCGGACAGGTATTTCAAAGTGGTCCACCCCCACCA 
CGCGGTGAACACTATCTCCACCCGGGTGGCGGCTGGCATCGTCTGCACCCTGTGGGCCCTGGTCATCCTGGGA 
ACAGTGTATCTTTTGCTGGAGAACCATCTCTGCGTGCAAGAGACGGCCGTCTCCTGTGAGAGCTTCATCATGG 
AGTCGGCCAATGGCTGGCATGACATCATGTTCCAGCTGGAGTTCTTTATGCCCCTCGGCATCATCTTATTTTG 
CTCCTTCAAGATTGTTTGGAGCCTGAGGCGGAGGCAGCAGCTGGCCAGACAGGCTCGGATGAAGAAGGCGACC 
CGGTTCATCATGGTGGTGGCAATTGTGTTCATCACATGCTACCTGCCCAGCGTGTCTGCTAGACTCTATTTCC 
TCTGGACGGTGCCCTCGAGTGCCTGCGATCCCTCTGTCCATGGGGCCCTGCACATAACCCTCAGCTTCACCTA 
CATGAACAGCATGCTGGATCCCCTGGTGTATTATTTTTCAAGCCCCTCCTTTCCCAAATTCTACAACAAGCTC 
AAAATCTGCAGTCTGAAACCCAAGCAGCCAGGACACTCAAAAACACAAAGGCCGGAAGAGATGCCAATTTCGA 
ACCTCGGTCGCAGGAGTTGCATCAGTGTGGCAAATAGTTTCCAAAGCCAGTCTGATGGGCAATGGGATCCCCA 
CATTGTTGAGTGGCACTGA 

The following amino acid sequence <SEQ ID NO. 80> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 79: 

MYNGSCCRIEGDTISQVMPPLL IVAFVLGALGNGVAXCGFCFH MKTWKPS TVYL.FNLAVADFLLMICLPF RTD 
YYLRRRHWAFGDIPCR VGLFTLAMNRAGSIV F LTVVAA DRYFKVVHPHHAVNTI STRVAAGIVCTLWALVILG 
TVYLL LENHLCVQETAVSCESFIMESANGWHD IMFQLEFFMPLG I ILFCSFKI VWSLRRRQQLARQARMKKAT 
RFIMWAIVFITCYLPSVSA RLYFLWTVPSSACDPS VHGAIjHITLSFTYMNSMLDPLVYYF SSPSFPKFYNKL 
KICSLKPKQPGHSKTQRPEEMPISNL.GRRSCIS VANS FQSQSDGQWDPH I VEWH 



The following DNA sequence nGPCR-16 <SEQ ID NO. 81> was identified in H. 
sapiens: 

ATGACAGGTGACTTCCCAAGTATGCCTGGCCACAATACCTCCAGGAATTCCTCTTGCGATCCTATAGACACCC 
CACTTAATCAGCCTCTACTTCATAGTGCTTATTGGCGGGCTGGTGGGTGTCATTTCCATTCTTTTCCTCCTGG 
TGAAAATGAACACCCGGTCAGTGACCACCATGGCGGTCATTAACTTGGTGGTGGTCCACAGCGTTTTTCTGCT 
GACAGTGCCATTTCGCTTGACCTACCTCATCAAGAAGACTTGGATGTTTGGGCTGCCCTTCTGCAAATTTGTG 
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AGTGCCATGCTGCACATCCACATGTACCTCACGTTCCTATTCTATGTGGTGATCCTGGTCACCAGATACCTCA 

TCTTCTTCAAGTGCAAAGACAAAGTGGAATTCTACAGAAAACTGCATGCTGTGGCTGCCAGTGCTGGCATGTG 

GACGCTGGTGATTGTCATTGTGGTACCCCTGGTTGTCTCCCGGTATGGAATCCATGAGGAATACAATGAGGAG 

CACTGTTTTAAATTTCACAAAGAGCTTGCTTACACATATGTGAAAATCATCAACTATATGATAGTCATTTTTG 

TCATAGCCGTTGCTGTGATTCTGTTGGTCTTCCAGGTCTTCATCATTATGTTGATGGTGCAGAAGCTACGCCA 

CTCTTTACTATCCCACCAGGAGTTCTGGGCTCAGCTGAAAAACCTATTTTTTATAGGGGTCATCCTTGTTTGT 

TTCCTTCCCTACCAGTTCTTTAGGATCTATTACTTGAATGTTGTGACGCATTCCAATGCCTGTAACAGCAAGG 

TTGCATTTTATAACGAAATCTTCTTGAGTGTAACAGCAATTAGCTGCTATGATTTGCTTCTCTTTGTCTTTGG 

GGGAAGCCATTGGTTTAAGCAAAAGATAATTGGCTTATGGAATTGTGTTTTGTGCCGTTAGCCACAAACTACA 

GTATTCATATTTGCTTCCTTTATATTGGGAATAAAAATGGGTATAGGGGAGGTAAGAATGGTATTTCATTACT 

TGATCAAAACCATGCCTTGATGTACCCAAAACAAAAGGACTATAAAATGCAAGAGCCCTCATTGTAGTCCTTA 

TGGGATCCCTCCCATCTCTGAGTGATGGCCGTACAAAGACCAGTGTTGTTGAATCCACCTGGAGTTGCAATAT 

TACATTATTTTCCAGTACAGAATGTCTGTGTGGCCCATGAAAGCAACATAGGTTTTAAGAGTTTTAGAGTTTC 

ATTAGCTCATTCTAAGTTCCTCTGTTTGAAGCATGGTCTCTTAGGTTTTGGACTGAACTCAGACCTTTAGTTC 

TTTTCATCCCACTTCACCTTAGGTAAGTAAATTCTGGCCACCACCCAGCTCCAAAGACACAAACTCTCCTTCG 

CTAACCAGGTTAGATGTCCCATTCATCTCATGCCCTGATAAAAACTGATAAGGGGAGAGAATAGTTAAAAATT 

TTTCTAGGGTATCATAACTCTGGTAGGAAGTCATCTGTCTAGAAATCAAGAGAAAAAGAACGTGTGGCCTCCT 

GTTATAACAAGGGTTTCTAGATTTGTCCTGTGAAAGGTCGTTTAAGGACTTGGGGATCAACTTCCTCAATTAT 

CACCAATTGCACTGTTGCTCCAAAAATCATTTAAAAGCTTACTGGACATATCTACATAATGGTGAAACTGTAA 

TTTAGAGACTATCCCTGACTAATGTGCTGGTAGGCATTAAAATGAGTTCCCAAGGGAAGTGATTAAAATTTTT 

TTCTCTTCTGTTTTTTGAGAGAATTTCTAGATGTCCTGGGCCACAGTTAATTAAGATTTTTAGGGGGGACAGA 

AAGTTATACTGAAATCTTTAGAGCTCCCTTCCGCCGTTAAAATTATATATATATATATTTAAATTATACCTTA 

AGTTCTGGGGTACATGTGCAGAATGTGCAGGTTTGTTACATAGGTATACACGTGCCATGGTGGTTTGCGGCAC 

CTGTCAACCCATCTACATTAGGTATTTCTCCTAATGCTCTCCCTCCCCTAGCCCCCCACCCCTGGACAGGCCC 

CATTGTGTGATGTTCCCCTCCCTGTGTCCATGTGTTTTCATTGTTCAACTCCCACTTCTAAGTGAGAACATGC 

GGTGTTTGGTTTTCTGTTCCTGTGTTAGTTTGCTGAGAATGATGGTTTCCAGGTTAAAATTATATATTTTTAA 

ATAAATGAAAACTGTGTTTTTAAAAGAGGACTTTTGAGAAGTATATAGAAAAACCATTAATTTAGACTCTGTG 

AGATTAGGTTGCATGAAGAAGGTTTTCTGAATATTTGAAGAGTGGATAAATAAATGTCCCCCAAAGCAATAAA 

ATCATAATCCTTTAAAATATAGGAAAAATAACTAATGGGAACTAGGCTTAATACTCGGGATGAAATAATCTGT 

ACAACAAACTCCCATGACACATGTTTACCTATGTAACAAACCTGCACATGTACCCCTGAACTTAAAATAAAAT 

TTAAAGTATAATAATAAAATAATATGGATTTTCTTT 

The following amino acid sequence <SEQ ID NO. 82> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ_ ID NO. 81: 

MTGDFPSMPGHNTSRNSSCDPIVTPHL ISLYFIVLIGGLVGVISILFLLV KMNTR SVTTMAVINLVWHSVFL 
LTVPFR LTYLIKKTWMFGLPFCK FVSAMLHIHMYLTFLFywiLV TRYLIFFKCKDKVEFYRKL HAVAASAGM 
WTLVIVIWPL WSRYGIHEEYNEEHCFKFHKELAYTYVK IINYMIVIFVIAVAVILLVFQVFII MLMVQKLR 
HSIjLSHQEFWAQLK NLFFIGVILVCFIjPYQFFRI YYLNVVTHSNACNSKVAFYNE IFLSVTAISCYDLLLFVF 
GG S HW F KQK I I G LWNCVL C R 

The following DNA sequence nGPCR-40 <SEQ ID NO. 83> was identified in H. 
sa.pl ens -. 

GCAGGAGCACTGAAAATCAGGAACAATCCTGTATTTTTTGTGATAATCAACAAGGACAAAACTTCTCCATATG 
TAAATAACAGCGTTATGAGCAGCAATTCATCCCTGCTGGTGGCTGTGCAGCTGTGCTACGCGAACGTGAATGG 
GTCCTGTGTGAAAATCCCCTTCTCGCCGGGATCCCGGGTGATTCTGTACATAGTGTTTGGCTTTGGGGCTGTG 
CTGGCTGTGTTTGGAAACCTCCTGGTGATGATTTCAATCCTCCATTTCAAGCAGCTGCACTCTCCGACCAATT 
TTCTCGTTGCCTCTCTGGCCTGCGCTGATTTCTTGGTGGGTGTGACTGTGATGCCCTTCAGCATGGTCAGGAC 
GGTGGAGAGCTGCTGGTATTTTGGGAGGAGTTTTTGTACTTTCCACACCTGCTGTGATGTGGCATTTTGTTAC 
TCTTCTCTCTTTCACTTGTGCTTCATCTCCATCGACAGGTACATTGCGGTTACTGACCCCCTGGTCTATCCTA 
CCAAGTTCACCGTATCTGTGTCAGGAATTTGCATCAGCGTGTCCTGGATCCTGCCCCTCATGTACAGCGGTGC 
TGTGTTCTACACAGGTGTCTATGACGATGGGCTGGAGGAATTATCTGATGCCCTAAACTGTATAGGAGGTTGT 
CAGACCGTTGTAAATCAAAACTGGGTGTTGACAGATTTTCTATCCTTCTTTATACCTACCTTTATTATGATAA 
TTCTGTATGGTAACATATTTCTTGTGGCTAGACGACAGGCGAAAAAGATAGAAAATACTGGTAGCAAGACAGA 
ATCATCCTCAGAGAGTTACAAAGCCAGAGTGGCCAGGAGAGAGAGAAAAGCAGCTAAAACCCTGGGGGTCACA 
GTGGTAGCATTTATGATTTCATGGTTACCATATAGCATTGATTCATTAATTGATGCCTTTATGGGCTTTATAA 
CCCCTGCCTGTATTTATGAGATTTGCTGTTGGTGTGCTTATTATAACTCAGCCATGAATCCTTTGATTTATGC 
TTTATTTTACCCATGGTTTAGGAAAGCAATAAAAGTTATTGTAACTGGTCAGGTTTTAAAGAACAGTTCAGCA 
ACCATGAATTTGTTTTCTGAACATATATAA 



90 



WO 111/36473 



PCT/IS00/31581 



The following amino acid sequence <SEQ ID NO. 84 > is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 83: 

MSSNSSLLVAVQLCYANVNGSCVKI PFSPGSR VILYI VFGFGAVIjAVFGNLLVMIS I LHFKQLH5PTN FLVAS 
LACADFLVGVTVMPFSMV RTVESCWYFGRSFCTFH TCCDVAFCYSSLFHLCFIS I DRYIAVTDPLVYPTK FTV 
S VSG ICI SVSWI LPLMYSG AVFYTGVYDDGLEELSDALNCIGGCQTVVNQNW VLTDFLSFFI PTFIMI ILYGN 
IFLV ARRQAKKIENTGSKTESSSESYKARVARRERKA AKTLGVTWAFMISWLPYSI DSLIDAFMGFIT PACI 
YEICCWCAYYNSAMNPLIY AIiFYPWFRKAIKVIVTGQVLKNSSATMNLFSEHI 



The following DNA sequence nGPCR-54 <SEQ ID NO. 85> was identified in H. 
sapiens : 

ACCATGAATGAGCCACTAGACTATTTAGCAAATGCTTCTGATTTCCCCGATTATGCAGCTGCTTTTGGAAATT 
GCACTGATGAAAACATCCCACTCAAGATGCACTACCTCCCTGTTATTTATGGCATTATCTTCCTCGTGGGATT 
TCCAGGCAATGCAGTAGTGATATCCACTTACATTTTCAAAATGAGACCTTGGAAGAGCAGCACCATCATTATG 
CTGAACCTGGCCTGCACAGATCTGCTGTATCTGACCAGCCTCCCCTTCCTGATTCACTACTATGCCAGTGGCG 
AAAACTGGATCTTTGGAGATTTCATGTGTAAGTTTATCCGCTTCAGCTTCCATTTCAACCTGTATAGCAGCAT 
CCTCTTCCTCACCTGTTTCAGCATCTTCCGCTACTGTGTGATCATTCACCCAATGAGCTGCTTTTCCATTCAC 
AAAACTCGATGTGCAGTTGTAGCCTGTGCTGTGGTGTGGATCATTTCACTGGTAGCTGTCATTCCGATGACCT 
TCTTGATCACATCAACCAACAGGACCAACAGATCAGCCTGTCTCGACCTCACCAGTTCGGATGAACTCAATAC 
TATTAAGTGGTACAACCTGATTTTGACTGCAAGTACTTTCTGCCTCCCCTTGGTGATAGTGACACTTTGCTAT 
ACCACGATTATCCACACTTTGACCCATGGACTGCAAACTGACAGCTGCCTTAAGCAGAAAGCACGAAGGCTAA 
CCATTCTGCTACTCCTTGCATTTTACGTATGTTTTTTACCCTTCCATATCTTGAGGGTCATTCAGGATCGAAT 
CTCAGCCTGCTTTCAATCAGTTGTTCCATTGAGAATCAGATCCATGAAGCTTACATCGTTTCTAGACCATTAT 
GCTGCTCTGAACACCTTTGGTAACCTGTTACTATATGTGGTGGTCAGCGACAACTTTCAGCAGGCTGTCTGCT 
CAACAGTGAGATGCAAAGTAAGCGGGAACCTTGAGCAAGCAAAGAAAATTAGTTACTCAAACAACCCTTGA 

The following amino acid sequence <SEQ ID NO. 86> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 85: 

MNEPLDYLANASDFPDYAAAFGNCTDENIPLiKMHYL PVI YGI I FLVGFPGNAVVI STYI FKMRPWKS STI I ML 
NLACTD L LYLTSLP FLIHYYASGENWI FGDFMCKFIR FSFHFNLYSS ILFLTCFS I FRYCVI IHPMSCFS IHK 
TR CAWACAVWIISLVAVIPMTFLIT STNRTNRSACLDLTSSDELNTIKWYNLILTAST FCLPLVIVTLCYT 
TIIHTLTHGLQTDSCLKQKARR LTILLLLAFYVCFLPFHILRVIQ DRISACFQSWPLRIRSMKLTS FLDHYA 
AL.NTFGNLLLYVVV SDNFQQAVCSTVRCKVSGNLEQAKKIS YSN 



The following DNA sequence nGPCR-56 <SEQ ID NO. 87> was identified in H. 
sapi ens : 

AAAAATTGCTGTACTGAACTATTGAATGGAACTTGGAAATAAAGTCCCTTCCAAAATAACTATTCTTCAACAG 
AGAGTAATAGGTAAATGTTTTAGAAGTGAGAGGACTCAAATTGCCAATGATTTACTCTTTTATTTTTCCTCCT 
AGGTTTCTGGGATAAGTATGTGCAAATAAAAAATAAACATGAGAAGGAACTGTAACCTGATTATGGATTTGGG 
AAAAAGATAAATCAACACACAAAGGGAAAAGTAAACTGATTGACAGCCCTCAGGAATGATGCCCTTTTGCCAC 
AATATAATTAATATTTCCTGTGTGAAAAACAACTGGTCAAATGATGTCCGTGCTTCCCTGTACAGTTTAATGG 
TGCTCATAATTCTGACCACACTCGTTGGCAATCTGATAGTTATTGTTTCTATATCACACTTCAAACAACTTCA 
TACCCCAACAAATTGGCTCATTCATTCCATGGCCACTGTGGACTTTCTTCTGGGGTGTCTGGTCATGCCTTAC 
AGTATGGTGAGATCTGCTGAGCACTGTTGGTATTTTGGAGAAGTCTTCTGTAAAATTCACACAAGCACCGACA 
TTATGCTGAGCTCAGCCTCCATTTTCCATTTGTCTTTCATCTCCATTGACCGCTACTATGCTGTGTGTGATCC 
ACTGAGATATAAAGCCAAGATGAATATCTTGGTTATTTGTGTGATGATCTTCATTAGTTGGAGTGTCCCTGCT 
GTTTTTGCATTTGGAATGATCTTTCTGGAGCTAAACTTCAAAGGCGCTGAAGAGATATATTACAAACATGTTC 
ACTGCAGAGGAGGTTGCTCTGTCTTCTTTAGCAAAATATCTGGGGTACTGACCTTTATGACTTCTTTTTATAT 
ACCTGGATCTATTATGTTATGTGTCTATTACAGAATATATCTTATCGCTAAAGAACAGGCAAGATTAATTAGT 
GATGCCAATCAGAAGCTCCAAATTGGATTGGAAATGAAAAATGGAATTTCACAAAGCAAAGAAAGGAAAGCTG 
TGAAGACATTGGGGATTGTGATGGGAGTTTTCCTAATATGCTGGTGCCCTTTCTTTATCTGTACAGTCATGGA 
CCCTTTTCTTCACTACATTATTCCACCTACTTTGAATGATGTA 

The following amino acid sequence <SEQ ID NO. 88> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 87: 

MMPFCHNI INI SCVKlJNWSNDVRASLY SLriVI-iI I LTTLVGNLI VI VS I SHF KQLHTPTNWL IHSMATVDFLLG 
CLVNPY5MVRSAEHCWYFGEVFCKIHTSTDIMLSSASI FHLSFI S IDRYYAVCDPLRYKAKMNI LVICVMI FI 
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SWSVPAVFAFGMIFL ELNFKGAEEI YYKHVHCRGG CSVFFSKISGVLTFMTSFYI PGS IM LCVY YRI YLIAKE 
QARLISDANQKLQIGLEMKMGISQSKERKAVKTL GI VMGVFLICWCPFFICTA/MDPFL HYI IPPTLNDARGSR 
ANSA 

The following DNA sequence nGPCR-56 <SEQ ID NO. 89> was identified in H. 
sapi ens : 

GGAATGATGCCCTTTTGCCACAATATAATTAATATTTCCTGTGTGAAAAACAACTGGTCAAATGATGTCCGTG 
CTTCCCTGTACAGTTTAATGGTGCTCATAATTCTGACCACACTCGTTGGCAATCTGATAGTTATTGTTTCTAT 
ATCACACTTCAAACAACTTCATACCCCAACAAATTGGCTCATTCATTCCATGGCCACTGTGGACTTTCTTCTG 
GGGTGTCTGGTCATGCCTTACAGTATGGTGAGATCTGCTGAGCACTGTTGGTATTTTGGAGAAGTCTTCTGTA 
AAATTCACACAAGCACCGACATTATGCTGAGCTCAGCCTCCATTTTCCATTTGTCTTTCATCTCCATTGACCG 
CTACTATGCTGTGTGTGATCCACTGAGATATAAAGCCAAGATGAATATCTTGGTTATTTGTGTGATGATCTTC 
ATTAGTTGGAGTGTCCCTGCTGTTTTTGCATTTGGAATGATCTTTCTGGAGCTAAACTTCAAAGGCGCTGAAG 
AGATATATTACAAACATGTTCACTGCAGAGGAGGTTGCTCTGTCTTCTTTAGCAAAATATCTGGGGTACTGAC 
CTTTATGACTTCTTTTTATATACCTGGATCTATTATGTTATGTGTCTATTACAGAATATATCTTATCGCTAAA 
GAACAGGCAAGATTAATTAGTGATGCCAATCAGAAGCTCCAAATTGGATTGGAAATGAAAAATGGAATTTCAC 
AAAGCAAAGAAAGGAAAGCTGTGAAGACATTGGGGATTGTGATGGGAGTTTTCCTAATATGCTGGTGCCCTTT 
CTTTATCTGTACAGTCATGGACCCTTTTCTTCACTACATTATTCCACCTACTTTGAATGATGTATTGATTTGG 
TTTGGCTACTTGAACTCTACATTTAATCCAATGGTTTATGCATTTTTCTATCCTTGGTTTAGAAAAGCACTGA 
AGATGATGCTGTTTGGTAAAATTTTCCAAAAAGATTCATCCAGGTGTAAATTATTTTTGGAATTGAGTTCATA 
G 

The following amino acid sequence <SEQ ID NO. 90> is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 89: 

MMPFCHNIINISCVKNNWSNDVRASLY SLMVLIILTTLVGNLIVIVSISHF KQLHTPTN WLIHSMATVDFLLG 
CLVMPY SMVRSAEHCWYFGEVF CKIHTSTDIMLSSAS I FHLSFIS I DRYYAVCDPLRYKAKMN ILVICVMI FI 
SWSVPAVFAFGMI FL ELNFKGAEEI YYKHVHCRGG CSVFFSKI SGVLTFMTSFYIPGS IM LCVYYRI YLIAKE 
QARLISDANQKljQIGLEMKNGISQSKERKAVKTL GIVT^GVFLICWCPFFICTVMDPFL HYI I PPTLND VLIWF 
GYLNSTFNPMV^AFF YPWFRKALKMNLFGKIFQKDSSRCKLFLELSS 



The following DNA sequence nGPCR-58 <SEQ ID NO. 91> was identified in H. 
sapi ens : 

CTGTAAAGTAGATTGTATGAGGACTCCATGAGGTCATCCACTTCAAGTCCTTGGCATAGGATAATTACTCAAA 
AGGTGATGACAATGGCGCAGGGAGGGATGGTGACTTGCCTGGAGATGCACAGCACCGTCTCTCCCATACTCGG 
TCATTCACACCATCATTGATTCACCAGGCACCACTCCGTGTCCAGCAGGACTCTGGGGACCCCAAATGGACAC 
TACCATGGAAGCTGACCTGGGTGCCACTGGCCACAGGCCCCGCACAGAGCTTGATGATGAGGACTCCTACCCC 
CAAGGTGGCTGGGACACGGTCTTCCTGGTGGCCCTGCTGCTCCTTGGGCTGCCAGCCAATGGGTTGATGGCGT 
GGCTGGCCGGCTCCCAGGCCCGGCATGGAGCTGGCACGCGTCTGGCGCTGCTCCTGCTCAGCCTGGCCCTCTC 
TGACTTCTTGTTCCTGGCAGCAGCGGCCTTCCAGATCCTAGAGATCCGGCATGGGGGACACTGGCCGCTGGGG 
ACAGCTGCCTGCCGCTTCTACTACTTCCTATGGGGCGTGTCCTACTCCTCCGGCCTCTTCCTGCTGGCCGCCC 
TCAGCCTCGACCGCTGCCTGCTGGCGCTGTGCCCACACTGGTACCCTGGGCACCGCCCAGTCCGCCTGCCCCT 
CTGGGTCTGCGCCGGTGTCTGGGTGCTGGCCACACTCTTCAGCGTGCCCTGGCTGGTCTTCCCCGAGGCTGCC 
GTCTGGTGGTACGACCTGGTCATCTGCCTGGACTTCTGGGACAGCGAGGAGCTGTCGCTGAGGATGCTGGAGG 
TCCTGGGGGGCTTCCTGCCTTTCCTCCTGCTGCTCGTCTGCCACGTGCTCACCCAGGCCACAGCCTGTCGCAC 
CTGCCACCGCCAACAGCAGCCCGCAGCCTGCCGGGGCTTCGCCCGTGTGGCCAGGACCATTCTGTCAGCCTAT 
GTGGTCCTGAGGCTGCCCTACCAGCTGGCCCAGCTGCTCTACCTGGCCTTCCTGTGGGACGTCTACTCTGGCT 
ACCTGCTCTGGGAGGCCCTGGTCTACTCCGACTACCTGATCCTACTCAACAGCTGCCTCAGCCCCTTCCTCTG 
CCTCATGGCCAGTGCCGACCTCCGGACCCTGCTGCGCTCCGTGCTCTCGTCCTTCGCGGCAGCTCTCTGCGAG 
GAGCGGCCGGGCAGCTTCACGCCCACTGAGCCACAGACCCAGCTAGATTCTGAGGGTCCAACTCTGCCAGAGC 
CGATGGCAGAGGCCCAGTCACAGATGGATCCTGTGGCCCAGCCTCAGGTGAACCCCACACTCCAGCCACGATC 
GGATCCCACAGCTCAGCCACAGCTGAACCCTACGGCCCAGCCACAGTCGGATCCCACAGCCCAGCCACAGCTG 
AACCTCATGGCCCAGCCACAGTCAGATTCTGTGGCCCAGCCACAGGCAGACACTAACGTCCAGACCCCTGCAC 
CTGCTGCCAGTTCTGTGCCCAGTCCCTGTGATGAAGCTTCCCCAACCCCATCCTCGCATCCTACCCCAGGGGC 
CCTTGAGGACCCAGCCACACCTCCTGCCTCTGAAGGAGAAAGCCCCAGCAGCACCCCGCCAGAGGCGGCCCCG 
GGCGCAGGCCCCACGTGAGGGTCCAGGAACACGCAGGCCCACCAGAGCAGTGAAAGAGCCCAGGGCAGACAGA 
GGAACCAGCCAGTCAGA 

The following amino acid sequence <SEQ ID NO. 92> is the predicted amino 
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acid sequence derived from the DNA sequence of SEQ ID NO. 91: 

LAWRCTAPSLPYSVIHTIIDSPGTTPCPAGLWGPQMDTTMEADLGATGHRPRTELDDEDSYPQGGWDTVFLVA 
LLLLGLPANGLMAWL AGS QARHGAGTR LALLLLSLALSPFLFLAAAAF Q I L E I RHGGHWPLGTAACR FYYFLW 
GVSYSSGLFLLAALSL DRCLLALCPHWYPGHRPVRLPL WVCAGVWVLATLFSVPWLVFP EAAVWWYDLVICLD 
FWDSEELS LRMLEVLGGFLPFLLLLVCHVLT QATACRTCHRQQQPAACRGFARVARTILSA YWLRLPYQLAQ 
LLYLAFLW DVYSGYLLWEALVYS DYLILLNSCLSPFLCLMASA DLRTLLRSVLSSFAAALCEERPGSFTPTEP 
QTQLDSEGPTLPEPMAEAQSQMDPVAQPQVNPTLQPRSDPTAQPQLNPTAQPQSDPTAQPQL.NLMAQPOSDSV 
AQPQADTNVQTPAPAASSVPSPCDEASPTPSSHPTPGALEDPATPPASEGESPSSTPPEAAPGAGPT 

The following DNA sequence nGPCR-58 <SEQ ID NO. 93 > was identified in H. 
sapi ens : 

ATGGACACTACCATGGAAGCTGACCTGGGTGCCACTGGCCACAGGCCCCGCACAGAGCTTGATGATGAGGACT 

CCTACCCCCAAGGTGGCTGGGACACGGTCTTCCTGGTGGCCCTGCTGCTCCTTGGGCTGCCAGCCA 

ATGGGTTGATGGCGTGGCTGGCCGGCTCCCAGGCCCGGCATGGAGCTGGCACGCGTCTGGCGCTGCTCCTGCT 

CAGCCTGGCCCTCTCTGACTTCTTGTTCCTGGCAGCAGCGGCCTTCCAGATCCTAGAGATCCGGCATGGGGGA 

CACTGGCCGCTGGGGACAGCTGCCTGCCGCTTCTACTACTTCCTATGGGGCGTGTCCTACTCCTCCGGCCTCT 

TCCTGCTGGCCGCCCTCAGCCTCGACCGCTGCCTGCTGGCGCTGTGCCCACACTGGTACCCTGGGCACCGCCC 

AGTCCGCCTGCCCCTCTGGGTCTGCGCCGGTGTCTGGGTGCTGGCCACACTCTTCAGCGTGCCCTGGCTGGTC 

TTCCCCGAGGCTGCCGTCTGGTGGTACGACCTGGTCATCTGCCTGGACTTCTGGGACAGCGAGGAGCTGTCGC 

TGAGGATGCTGGAGGTCCTGGGGGGCTTCCTGCCTTTCCTCCTGCTGCTCGTCTGCCACGTGCTCACCCAGGC 

CACAGCCTGTCGCACCTGCCACCGCCAACAGCAGCCCGCAGCCTGCCGGGGCTTCGCCCGTGTGGCCAGGACC 

ATTCTGTCAGCCTATGTGGTCCTGAGGCTGCCCTACCAGCTGGCCCAGCTGCTCTACCTGGCCTTCCTGTGGG 

ACGTCTACTCTGGCTACCTGCTCTGGGAGGCCCTGGTCTACTCCGACTACCTGATCCTACTCAACAGCTGCCT 

CAGCCCCTTCCTCTGCCTCATGGCCAGTGCCGACCTCCGGACCCTGCTGCGCTCCGTGCTCTCGTCCTTCGCG 

GCAGCTCTCTGCGAGGAGCGGCCGGGCAGCTTCACGCCCACTGAGCCACAGACCCAGCTAGATTCTGAGGGTC 

CAACTCTGCCAGAGCCGATGGCAGAGGCCCAGTCACAGATGGATCCTGTGGCCCAGCCTCAGGTGAACCCCAC 

ACTCCAGCCACGATCGGATCCCACAGCTCAGCCACAGCTGAACCCTACGGCCCAGCCACAGTCGGATCCCACA 

GCCCAGCCACAGCTGAACCTCATGGCCCAGCCACAGTCAGACTCTGTGGCCCAGCCACAGGCAGACACTAACG 

TCCAGACCCCTGCACCTGCTGCCAGTTCTGTGCCCAGTCCCTGTGATGAAGCTTCCCCAACCCCATCCTCGCA 

TCCTACCCCAGGGGCCCTTGAGGACCCAGCCACACCTCCTGCCTCTGAAGGAGAAAGCCCCAGCAGCACCCCG 

CCAGAGGCGGCCCCGGGCGCAGGCCCCACGTGA 

The following amino acid sequence <SEQ ID NO. 94 > is the predicted amino 
acid sequence derived from the DNA sequence of SEQ ID NO. 93: 

MDTTMEADLGATGHRPRTELDDEDSYPQGGWDT VFLVALLLLGLPANGLMAWL AGSQARHGAGTR LALLLLSL 
ALSDFLFLAAAAF QILEIRHGGHWPLGTAACRFYYF LWGVSYSSGLFLLAALSL DRCLLALCPHWYPGHRPVR 
LPL WVCAGVWVLATLFSVPWLVFP EAAVWWYDLVICLDFWDSEEL.SL RMLEVLGGFLPFLLLLVCHVLT QATA 
CRTCHRQQQPAACRGFARVARTILSA YWLRLPYQLAQLLYLAFLW DVYSGYLLWEALVYS DYLILLNSCIiSP 
FLCLMAS ADLRTLLRSVLSSFAAAX.CEERPGSFTPTEPQTQLDSEGPTLPEPMAEAOSOMDPVAQPQVNPTT.Q 
PRSDPTAQPQLNPTAQPQSDPTAQPQLNLMAQPQSDSVAQPQADTNVQTPAPAA 

The following DNA sequence nGPCR-3 <SEQ ID NO. 185> was identified in H sapiens: 

AGGCTCGCGCCCGAAGCAGAGCCATGAGAACCCCAGGGTGCCTGGCGAGCCGCTAGCGCCATGGGCCCCGGCG 
AGGCGCTGCTGGCGGGTCTCCTGGTGATGGTACTGGCCGTGGCGCTGCTATCCAACGCACTGGTGCTGCTTTG 
TTGCGCCTACAGCGCTGAGCTCCGCACTCGAGCCTCAGGCGTCCTCCTGGTGAATCTGTCTCTGGGCCACCTG 
CTGCTGGCGGCGCTGGACATGCCCTTCACGCTGCTCGGTGTGATGCGCGGGCGGACACCGTCGGCGCCCGGCG 
CATGCCAAGTCATTGGCTTCCTGGACACCTTCCTGGCGTCCAACGCGGCGCTGAGCGTGGCGGCGCTGAGCGC 
AGACCAGTGGCTGGCAGTGGGCTTCCCACTGCGCTACGCCGGACGCCTGCGACCGCGCTATGCCGGCCTGCTG 
CTGGGCTGTGCCTGGGGACAGTCGCTGGCCTTCTCAGGCGCTGCACTTGGCTGCTCGTGGCTTGGCTACAGCA 
GCGCCTTCGCGTCCTGTTCGCTGCGCCTGCCGCCCGAGCCTGAGCGTCCGCGCTTCGCAGCCTTCACCGCCAC 
GCTCCATGCCGTGGGCTTCGTGCTGCCGCTGGCGGTGCTCTGCCTCACCTCGCTCCAGGTGCACCGGGTGGCA 
CGCAGACACTGCCAGCGCATGGACACCGTCACCATGAAGGCGCTCGCGCTGCTCGCCGACCTGCACCCCAGTG 
TGCGGCAGCGCTGCCTCATCCAGCAGAAGCGGCGCCGCCACCGCGCCACCAGGAAGATTGGCATTGCTATTGC 
GACCTTCCTCATCTGCTTTGCCCCGTATGTCATGACCAGGCTGGCGGAGCTCGTGCCCTTCGTCACCGTGAAC 
GCCCAGTGGGGCATCCTCAGCAAGTGCCTGACCTACAGCAAGGCGGTGGCCGACCCGTTCACGTACTCTCTGC 
TCCGCCGGCCGTTCCGCCAAGTCCTGGCCGGCATGGTGCACCGGCTGCTGAAGAGAACCCCGCGCCCAGCATC 
CACCCATGACAGCTCTCTGGATGTGGCCGGCATGGTGCACCAGCTGCTGAAGAGAACCCCGCGCCCAGCGTCC 
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ACCCACAACGGCTCTGTGGACACAGAGAATGATTCCTGCCTGCAGCAGACACACTGAGGGCCTGGCAGGGCTC 
ATCGCCCCCACCTTCTAAGA 

The following amino acid sequence <SEQ ID NO. 186> is the predicted amino acid sequence derived from 
the DNA sequence of SEQ ID NO. 185: 

MGPGEALI ^GLLVMVXAVALLSNAIiVLLCCA YSAE 

SAPGAC QVIGFLDTFLASNAALSVAAIiSA DQWLAVGFPLRYAGRLRPR YAGLLLGCAWGQSLAFSGAALGC SW 
LGYSSAFASCSLRLPPEPERPR FAAFTATLHAVGFVLPIiAVLCLT SLQVHRVARRHCQRMDTVTMKAI^LAD 
LHPS VRQRCL I QQ KRRRHRATR KIGIAIATFLICFAPYVMT RLAELVPFVTVNAQWG I LS K CLTYSKAVADPF 
TYSLL RRPFRQVLAGMVHRLLKRTPRPASTHDSSLDVAGMVHQLLKRTPRPASTHNGSVDTENDSCLOOTH 



EXAMPLE 2: CLONING OF nGPCR-X 

To isolate a cDNA clone encoding full length nGPCR-x, a DNA fragment 
corresponding to a nucleotide sequence set forth in odd numbered nucleotide 
sequences ranging from SEQ ID NO: 1-93, or a portion thereof, can be used as a 
probe for hybridization screening of a phage cDNA library. The DNA fragment is 
amplified by the polymerase chain reaction (PCR) method. The PCR reaction 
mixture of 50 ul contains polymerase mixture (0.2 mM dNTPs, lx PCR Buffer and 
0.75 ul Expand High Fidelity Polymerase (Roche Biochemicals)), 1 ug of plasmid, 
and 50 pmoles of forward primer and 50 pmoles of reverse primer. The primers are 
preferably 10 to 25 nucleotides in length and are determined by procedures well 
known to those skilled in the art. Amplification is performed in an Applied 
Biosystems PE2400 thermocycler, using the following program: 95 U C for 15 seconds, 
52°C for 30 seconds and 72°C for 90 seconds; repeated for 25 cycles. The amplified 
product is separated from the plasmid by agarose gel electrophoresis, and purified by 
Qiaquick™ gel extraction kit (Qiagen). 

A lambda phage library containing cDNAs cloned into lambda ZAPII phage- 
vector is plated with E. coli XL-1 blue host, on 15 cm LB-agar plates at a density of 
50,000 pfu per plate, and grown overnight at 37°C; (plated as described by Sambrook 
et al., supra). Phage plaques are transferred to nylon membranes (Amersham Hybond 
NJ), denatured for 2 minutes in denaturation solution (0.5 M NaOH, 1.5 M NaCl), 
renatured for 5 minutes in renaturation solution (1 M Tris pH 7.5, 1.5 M NaCl), and 
washed bnefly in 2xSSC (20x SSC: 3 M NaCl, 0.3 M Na-citrate). Filter membranes 
are dried and incubated at 80°C for 120 minutes to cross-link the phage DNA to the 
membranes. 

The membranes arc hybridized with a DNA probe prepared as described 
above. A DNA fragment (25 ng) is labeled with a- 32 P-dCTP (NEN) using 
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Rediprime™ random priming (Amersham Pharmacia Biotech), according to 
manufacturers instructions. Labeled DNA is separated from unincorporated 
nucleotides by S200 spin columns (Amersham Pharmacia Biotech), denatured at 95°C 
for 5 minutes and kept on ice. The DNA-containing membranes (above) are pre- 
5 hybridized in 50 ml ExpressHyb™ (Clontech) solution at 68°C for 90 minutes. 

Subsequently, the labeled DNA probe is added to the hybridization solution, and the 
probe is left to hybridize to the membranes at 68°C for 70 minutes. The membranes 
are washed five times in 2x SSC, 0. 1% SDS at 42°C for 5 minutes each, and finally 
washed 30 minutes in 0.1 x SSC, 0.2% SDS. Filters are exposed to Kodak XAR™ 
10 film (Eastman Kodak Company, Rochester, N.Y., USA) with an intensifying screen at 
-80°C for 16 hours. One positive colony is isolated from the plates, and replated with 
about 1000 pfu on a 15 cm LB plate. Plating, plaque lift to filters and hybridization 
are performed as described above. About four positive phage plaques are isolated 
form this secondary screening. 
15 cDNA containing plasmids (pBluescript SK-) are rescued from the isolated 

phages by in vivo excision by culturing XL-1 blue cells co-infected with the isolated 
phages and with the Excision helper phage, as described by manufacturer 
(Stratagene). XL-blue cells containing the plasmids are plated on LB plates and 
grown at 37°C for 16 hours. Colonies (18) from each plate are replated on LB plates 
20 and grown. One colony from each plate is stricken onto a nylon filter in an ordered 
array, and the filter is placed on a LB plate to raise the colonies. The filter is then 
hybridized with a labeled probe as described above. About three positive colonies are 
selected and grown up in LB medium. Plasmid DNA is isolated from the three clones 
by Qiagen Midi Kit™ (Qiagen) according to the manufacturer's instructions. The 
25 size of the insert is determined by digesting the plasmid with the restriction enzymes 
NotI and Sail, which establishes an insert size. The sequence of the entire insert is 
determined by automated sequencing on both strands of the plasmids. 
nGPCR-1: PCR AND SUBCLONING 

cDNAs were sequenced directly using an AB1377 fluorescence-based 
30 sequencer (Perkin Elmer/Applied Biosystems Division, PE/ABD, Foster City, CA) 
and the ABI PRISM Ready Dye-Deoxy Terminator kit with Taq FS polymerase. 
Each ABI cycle sequencing reaction contained about 0.5u.g of plasmid DNA. Cycle- 
sequencing was performed using an initial denaturation at 9S°C for 1 min, followed 
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by 50 cycles; 98°C for 30 sec, annealing at 50°C for 30 sec, and extension at 60°C for 
4 min. Temperature cycles and times were controlled by a Perkin-Elmer 9600 
thermocycler. Extension products were purified using AGTC® gel filtration block 
(Edge BiosSystems, Gaithersburg, MD). Each reaction product was loaded by pipette 
5 onto the column, which was then centrifuged in a swinging bucket centrifuge (Sorvall 
model RT6000B table top centrifuge) at 1500 x g for 4 min at room temperature. 
Column-purified samples were dried under vacuum for about 40 min and then 
dissolved in 5ul of a DNA loading solution (83% deionized formamide, 8.3 mM 
EDTA, and 1 .6 mg/ml Blue Dextran). The samples were then heated to 90°C for 

10 three min and loaded into the gel sample wells for sequence analysis by the ABI377 
sequencer. Sequence analysis was performed by importing ABI373A files into the 
Sequencher program (Gene Codes, Ann Arbor, MI). 

The PCR reaction was performed in 50^iL samples containing 41.9/iL H 2 0, 
SfiL lOx Buffer containing 15 mM MgCl 2 (Boehringer Mannheim Expand High 

15 Fidelity PCR System), 0.5/iL lOmM dNTP mix, 1 .5/xL human genomic DNA 

(Clontech #6550-1, 0.1/*g//xL), 0.3/iL primer VR1 A (\figfnL), 03fiL pnmer VR1B 
( 1/xg/ptL), and 0.5^L High Fidelity Taq polymerase (Boehringer Mannheim, 3.5U//il). 
The primer sequences for and, respectively were: 

5'TCAAAGCTTATGGAATCATCTTTCTCATTTGGAGTGATCCTTGCTGTC, 
20 (VR1 A )(SEQ ID NO: 95) corresponding to the 5' end of the coding region and 
containing a HindlH restriction site, and: 

5 ' TTC ACTCG AGTT AGCCATC AA ACTCTG AGCTGG AG ATAGTG ACG ATGTG 
<VR1B)(SEQ ID NO: 96) corresponding to the 3' end of the coding region and 
containing an Xhol restriction site (Genosys). The PCR reaction was carried out using 
25 a GeneAmp PCR9700 thermocycler (Perkin Elmer Applied Biosystems) and started 
with 1 cycle of 94°C for 2 min followed by 5 cycles at 94°C for 30 sec, 60°C for 2 
min, 72°C for 1.5 min, followed by 20 cycles at 94°C for 30 sec, 60°C for 30 sec, 
72°C for 1.5 min. 

The PCR reaction was loaded onto a 0.75% agarose gel. The DNA band was 
30 excised from the gel and the DNA eluted from the agarose using a QIAquick gel 

extraction kit (Qiagen). The eluted DNA was ethanol-precipitated and resuspended in 
4/zL H 2 0 for ligation. The ligation reaction consisted of 4/xL of fresh ethanol- 
precipitated PCR product and l^tL of pCRII-TOPO vector (Invitrogen). The reaction 
was gently mixed and allowed to incubate for 5 min. at room temperature followed by 
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the addition of \(j.L of 6x TOPO cloning stop solution and mixing for 10 sec. at room 
temperature. The sample was then placed on ice and 2/xL was transformed in 50/xL of 
One Shot cells (Invitrogen) and plated onto ampicillm plates. Four white colonies 
were chosen and the presence of an insert was verified by PCR in the following 
5 manner. Each colony was resuspended in 2 ml LB broth for 2 hrs. A 500xxL aliquot 
was spun down in the microfuge, the supernatant discarded, and the pellet 
resuspended in 25/iL of H2O. A \6fiL aliquot was removed and boiled for 5 min and 
the sample was placed on ice. The sample was microfuged briefly to pellet any 
bacterial debris and PCR was carried out with 15iaL sample using primers VR1A and 

10 VR1B, described above. 

Colonies from positive clones identified by PCR were used to inoculate a 4ml 
culture of LB medium containing 100 iig/ml ampicillin. Plasmid DNA was purified 
using the Wizard Plus Minipreps DNA purification system (Promega). Since the 
primers used to amplify the fragment of nGPCR-1 from genomic DNA were 

15 engineered to have Hindlll and Xliol sites, the cDNA obtained from the minipreps 
was digested with these restriction enzymes. One clone was verified by gel 
electrophoresis to give a DNA band of the correct size. cDNA from this clone was 
then sequenced, yielding the sequence of SEQ ID NO: 73. 
nGPCR-3: PCR AND SUBCLONING 

20 First-strand cDNA synthesis was performed following the directions for 3'- 

RACE ready cDNA from the SMART™ RACE cDNA Amplification Kit (Clontech). 
First 3 \i\ of H 2 0, 1 \l\ human whole brain poly A" RNA (litg/id) (Clontech, 6516-1) 
and 1 fi\ 3'-CDS primer were mixed together, incubated at 70°C for 2 minutes, then 
placed on ice for 2 minutes. Added to the tube was 2 ji\ 5X First-Strand buffer, 1 fx\ 

25 20 mM DTT, 1 /xl dNTP mix (10 mM) and 1 /xl Superscript II RT (200 units//xl) 

(GIBCO/BRL). The tube was incubated at 42°C for 1.5 hours then the reaction was 
diluted with 250 /xl of Tricine-EDTA buffer. 

PCR was performed in a 50 u,l reaction using components that come with the 
Advantage®-GC cDNA PCR Kit. The PCR reaction contained 22.4 ul H 2 0, 10 u.1 

30 5X GC cDNA PCR Reaction buffer, 10 ul 5M GC Melt, lul 50X dNTP mix (L0 mM 
each), 5 ul human brain cDNA, 0.3 ul of LW1649 (SEQ ID NO: 187)(1 ug/fal), 0.3 ul 
ofLW1650(SEQ ID NO: 188)(1 ug^l), 1 ul 50X Advantage-GC cDNA polymerase 
mix. The PCR reaction was performed in a Perkin-Elmer 9600 GeneAmp PCR 
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System starting with 1 cycle of 94°C for 2 min then 8 cycles at 94°C for 15 sec, 72°C 
for 2 min (decreasing 1°C with each cycle), 72°C for 3 min, followed by 30 cycles of 
94°C for 15 sec, 68°C for 3 min. The PCR reaction was loaded onto a 1.2 % agarose 
gel. The DNA band was excised from the gel, placed in GenElute Agarose spin 
5 column (Supelco) and spun for 10 min at maximum speed in a microcentrifuge. The 
eluted DNA was EtOH precipitated and resuspended in 4 H 2 0 for ligation. The PCR 
primer sequence for LW1649 was: 

GCATAAGCTT GCCATGGGCCCCGGCGAGG (SEQ ID NO: 187) 
and for LW 1650 was: 
10 GCATTCTAGA CCTCAGTGTGTCTGCTGC (SEQ ID NO: 188). The 

underlined portion of the primers matches the 5' and 3' areas, respectively, of the 
coding region. 

The ligation reaction used solutions from the TOPO TA Cloning Kit 
(Invitrogen) which consisted of 4u.l PCR product DNA, 1 ^il Salt Solution and 1 p.1 

15 pCRITTOPO vector that was incubated for 5 minutes at room temperature and then 
placed on ice. Two microliters of the ligation reaction was transformed in One-Shot 
TOP 10 cells (Invitrogen), and placed on ice for 30 minutes. The cells were heat- 
shocked for 30 seconds at 42°C, placed on ice for two minutes, 250 fi\ of SOC was 
added, then incubated at 37°C with shaking for one hour and then plated onto 

20 ampicillin plates. A single colony containing an insert was used to inoculate a 5 ml 
culture of LB medium. Plasmid DNA was purified using a Concert Rapid Plasmid 
Miniprep System (GibcoBRL) and then sequenced. 

The DNA subcloned into pCRII-TOPO was sequenced using the ABI 
PRISiM™ 310 Genetic Analyzer (PE Applied Biosystems) which uses advanced 

25 capillary electrophoresis technology and the ABI PRISM™ BigDye™ Terminator 
Cycle Sequencing Ready Reaction Kit. Each cycle-sequencing reaction contained 6 
Hi of H 2 0, 8 ul of BigDye Terminator mix, 5 u,l mini-prep DNA (0.1 u-g/ul), and 1 u.1 
primer (25 ng/ul) and was performed in a Perkin-Elmer 9600 thermocycler with 25 
cycles of 96°C for 10 sec, 50°C for 10 sec, and 60°C for 4 min. The product was 

30 purified using a Centriflex 1M gel filtration cartridge, dried under vacuum, then 

dissolved in 16 ul of Template Suppression Reagent (PE Applied Biosystems). The 
samples were heated at 95°C for 5 min then placed in the 310 Genetic Analyzer, 
yielding the sequence of SEQ ID NO: 95. 
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nGPCR-9: PCR AND SUBCLONING 

The PCR reaction was performed in 50 u.1 containing 34.5 [x\ H 2 0, 5 ul Buffer 
II (PE Applied Biosystems AmpliTaq Gold system), 6 u.1 25 mM MgCl 2 , 2 |il 10 mM 
dNTP mix, 1.5 yi\ human genomic DNA (Clontech #6550-1, 0.1 ng/nl), 0.3 jal primer 
5 VR9A (1 ng/fal), 0.3 |al pnmer VR9B (3 u,g/ul), and 0.4 u.1 AmpliTaq Gold™ DNA 
Polymerase. The primer sequences for VR9A and VR9B were as follows: 

VR9A 5'TTCAAAGCTTATGGAGTCGGGGCTGCTG 3* (SEQ ID NO: 

101) , corresponding to the 5' end of the coding region and containing a Hindi// 
restriction site, and the reverse primer was: 

10 VR9B 5' TTC ACTCG AGTCAGTCTGC AGCCGGTTCTG 3 ' , (SEQ ID NO: 

102) , corresponding to the 3' end of the coding region and containing an Xhol 
restriction site (Genosys). The PCR reaction was carried out using a GeneAmp PCR 
9700 thermocycler (Perkin Elmer Applied Biosystems) and started with 1 cycle of 
95°C for 10 min, then 10 cycles at 95°C for 30 sec, 72°C for 2 min decreasing 1°C 

15 each cycle, 72°C for 1 min, followed by 30 cycles at 95°C for 30 sec, 60°C for 30 sec, 
72°C for 1 min. The PCR reaction was loaded on a 0.75% gel. The DNA band was 
excised from the gel and the DNA was eluted from the agarose using a QIAquick gel 
extraction kit (Qiagen). The eluted DNA was ethanol-precipitated and resuspended in 
4 u,l H 2 0 for ligation. The ligation reaction consisted of 4 u.1 of fresh ethanol- 

20 precipitated PCR product and 1 ul of pCRII-TOPO vector (Invitrogen). The reaction 
was gently mixed and allowed to incubate for 5 min at room temperature followed by 
the addition of 1 u.1 of 6x TOPO cloning stop solution and mixing for 10 sec at room 
temperature. The sample was then placed on ice and 2 u.1 was transformed in 50 \a\ of 
One Shot cells (Invitrogen) and plated onto ampicillin plates. Five white colonies 

25 were chosen and were used to inoculate a 4 ml culture of LB medium containing 100 
u-g/ml ampicillin. Plasmid DNA was purified using the Wizard Plus Minipreps DNA 
purification system (Promega). Since the primers used to PCR SEQ-9 from genomic 
DNA were engineered to have Hindi// and Xhol sites, the cDNA obtained from the 
minipreps was digested with these restriction enzymes. One clone was verified by gel 

30 electrophoresis to give a DNA band of the correct size. cDNA from this clone was 
then submitted for sequencing. One mutation was found (bp 621 T-^G) and repaired 
as described as below. 
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The mutation in the identified clone was repaired using the QuikChange Site- 
Directed Mutagenesis Kit (Stratagene). The PCR reaction contained 39.3 u.1 H 2 0, 5 
u.1 1 Ox reaction buffer, 50 ng mini-prep cDNA, 1 .25 pi primer VR9E ( 1 00 ng/u.1), 
1 .25 u.1 primer VR9F (100 ng/u.1), 1 u.1 20 raM dNTP mix, 1 u.1 Pfu DNA polymerase. 
5 The cycle conditions were 95°C for 30 sec, then 12 cycles at 95°C for 30 sec, 55°C 
for 1 min, 68°C for 10 min. One u.1 of Dpnl was added and the tube incubated at 
37°C for 1 hr. One pi of the DpnI-treated DNA was transformed into 50 pi Epicurian 
coli XLl-Blue supercompetent cells and the entire insert was re-sequenced. The 
primer sequences used were: 
10 VR9E: 5' GCATCCTGGCCGCTATCTGTGCACTCTACG 3' (SEQ ID NO: 



VR9F: 5' CGTAGAGTGCACAGATAGCGGCCAGGATGC 3' (SEQ ID 
NO: 104) where the base underlined was the base being corrected. 

The clone described above was sequenced directly using an ABI377 

15 fluorescence-based sequencer (Perkin Elmer/Applied Biosystems Division, PE/ABD, 
Foster City, CA) and the ABI BigDye™ Terminator Cycle Sequencing Ready 
Reaction kit with Taq FS™ polymerase. Each ABI cycle sequencing reaction 
contained 0.5 pg of plasmid DNA. Cycle-sequencing was performed using an initial 
denaturation at 98°C for 1 min, followed by 50 cycles: 96°C for 30 sec, annealing at 

20 50°C for 30 sec, and extension at 60°C for 4 min. Temperature cycles and times were 
controlled by a Perkin-Elmer 9600 thermocycler. Extension products were purified 
using AGTC (R) gel filtration block (Edge BiosSystems, Gaithersburg, MD). Each 
reaction product was loaded by pipette onto the column, which was then centrifuged 
in a swinging bucket centrifuge (Sorvall model RT6000B tabletop centrifuge) at 1500 

25 x g for 4 min at room temperature. Column-purified samples were dried under 

vacuum for about 40 min and then dissolved in 3 pi of a DNA loading solution (83% 
deionized formamide, 8.3 mM EDTA, and 1.6 mg/ml Blue Dextran). The samples 
were then heated to 90°C for 3.5 min and loaded into the gel sample wells for 
sequence analysis by the ABI377 sequencer. Sequence analysis was performed by 

30 importing ABI377 files into the 3 10 Genetic Analyzer, yielding the sequence of SEQ 
ID NO: 77. 
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nGPCR-1 1 : PCR AND SliBCLONING 

PCR was performed in a 50 ul reaction containing 32 ul H 2 0, 5 jil 10X TT 
buffer (140 mM Ammonium Sulfate, 0.1 % gelatin, 0.6 M Tris-tricine pH 8.4), 5 
15 mM MgS0 4 , 2 jj-1 10 mM dNTP, 5 ul human genomic DNA (0.3ug/ul)(Clontech), 
5 0.3 ulofLW1564(l ng/ul), 0.3 ul of LW1565 (1 ug/ul), 0.4 ul High Fidelity Taq 
polymerase (Boehringer Mannheim). The PCR reaction was performed in a 
GeneAmp 9600 PCR thermocycler (PE Applied Biosystems) starting with 1 cycle of 
94°C for 2 min followed by 17 cycles at 94°C for 30 sec, 72°C for 2 min decreasing 
1 °C each cycle, 68°C for 2 min, then 25 cycles of 94°C for 30 sec, 55°C for 30 sec, 
10 68°C for 2 min. The PCR reaction was loaded onto a 1 .2 % agarose gel. The DNA 
band was excised from the gel, placed in GenElute Agarose spin column (Supelco) 
and spun for 10 min at maximum speed in a microcentrifuge. The eluted DNA was 
EtOH precipitated and resuspended in 4ul H 2 0 for ligation. The forward PCR primer 
sequence was: 

15 LW1564: GCATAAGCTTCCATGTACAACGGGTCGTGCTGC (SEQ ID 

NO: 107), and the reverse PCR primer was: 

LW1565: GCATTCTAGATCAGTGCCACTCAACAATGTGGG (SEQ ID 
NO: 108). 

The ligation reaction used solutions from the TOPO TA Cloning Kit 
20 (Invitrogen) which consisted of 4 ul PCR product DNA and 1 ul pCRII-TOPO vector 
that was incubated for 5 minutes at room temperature. To the ligation reaction one 
microliter of 6X TOPO Cloning Stop Solution was added then the reaction was placed 
on ice. Two microliters of the ligation reaction was transformed in One Shot TOP 10 
cells (Invitrogen), and placed on ice for 30 minutes. The cells were heat-shocked for 
25 30 seconds at 42°C, placed on ice for two minutes, 250 Tl of SOC was added, then 

incubated at 37°C with shaking for one hour and then plated onto ampicillin plates. A 
single colony containing an insert was used to inoculate a 5 ml culture of LB medium. 
Plasmid DNA was purified using a Concert Rapid Plasmid Miniprep System 
(GibcoBRL) and then sequenced. 
30 The DNA subcloned into pCRII was sequenced using the ABI PRISM™ 310 

Genetic Analyzer (PE Applied Biosystems) which uses advanced capillary 
electrophoresis technology and the ABI PRISM IM BigDye™ Terminator Cycle 
Sequencing Ready Reaction Kit. Each cycle-sequencing reaction contained 6 ul of 
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H 2 0, 8 p.1 of BigDye Terminator mix, 5 ^1 mini-prep DNA (0.1 ng/j-il), and 1 \i\ 
primer (25 ng/u.1) and was performed in a Perkin-Elmer 9600 thermocycler with 25 
cycles of 96°C for 10 sec, 50°C for 10 sec, and 60°C for 4 min. The product was 
purified using a Centriflex™ gel filtration cartridge, dried under vacuum, then 
5 dissolved in 16 ul of Template Suppression Reagent (PE Applied Biosystems). The 
samples were heated at 95°C for 5 min then placed in the 310 Genetic Analyzer, 
yielding the sequence of SEQ ID NO: 79. 
nGPCR-16: PCR AND SUBCLONING 

PCR was performed in a 50 ul reaction containing 32 ul H 2 0, 5 pi 10X TT 

10 buffer (140 mM Ammonium Sulfate, 0.1 % gelatin, 0.6 M Tris-tricine pH 8.4), 5 \il 
15 mM MgS0 4 , 2 ul lOmM dNTP, 5 pi 2445704H1 DNA (0.17 Tg/Tl), 0.3 ul of 
LW1587 (1 ug/u-1), 0.3 pi of LW 1588 (1 ug/uJ), 0.4 ul High Fidelity Taq polymerase 
(Boehringer Mannheim). The PCR reaction was performed on a Robocycler 
thermocycler (Stratagene) starting with 1 cycle of 94°C for 2 min followed by 1 5 

15 cycles of 94°C for 30 sec, 55°C for 1.3 min, 68°C for 2 min. The PCR reaction was 
loaded onto a 1 .2 % agarose gel. The DNA band was excised from the gel, placed in 
GenElute Agarose spin column (Supelco) and spun for 10 min at maximum speed in a 
microcentrifuge. The eluted DNA was EtOH precipitated and resuspended in 12ul 
H2O for ligation. The PCR primer sequence for the forward primer was: 

20 LW1587: GATCAAGCTTATGACAGGTGACTTCCCAAGTATGC (SEQ 

ID NO: 111), and the sequence for the reverse primer was: 

LW1588: GATCCTCGAGGCTAACGGCACAAAACACAATTCC (SEQ ID 
NO: 112). 

The ligation reaction used solutions from the TOPO TA Cloning Kit 
25 (Invitrogen) which consisted of 4ul PCR product DNA and 1 pi pCRII-TOPO vector 
that was incubated for 5 minutes at room temperature. To the ligation reaction one 
microliter of 6X TOPO Cloning Stop Solution was added then the reaction was placed 
on ice. Two microliters of the ligation reaction was transformed in One-Shot TOP10 
cells (Invitrogen), and placed on ice for 30 minutes. The cells were heat-shocked for 
30 30 seconds at 42°C, placed on ice for two minutes, 250 pi of SOC was added, then 

incubated at 37°C with shaking for one hour and then plated onto ampicillin plates. A 
single colony containing an insert was used to inoculate a 5 ml culture of LB medium. 
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Plasmid DNA was purified using a Concert Rapid Plasmid Miniprep System 
(GibcoBRL) and then sequenced. 

The DNA subcloned into pCRIJ was sequenced using the ABI PRISM™ 310 
Genetic Analyzer (PE Applied Biosystems) which uses advanced capillary 
electrophoresis technology and the ABI PRISM™ BigDye™ Terminator Cycle 
Sequencing Ready Reaction Kit. Each cycle-sequencing reaction contained 6 u.1 of 
H 2 0, 8 u.1 of BigDye Terminator mix, 5 |il mini-prep DNA (0.1 Mg/u.1), and 1 u.1 
primer (25 ng/u.1) and was performed in a Perkin-Elmer 9600 thermocycler with 25 
cycles of 96°C for 10 sec, 50°C for 10 sec, and 60°C for 4 min. The product was 
purified using a Centriflex™ gel filtration cartridge, dried under vacuum, then 
dissolved in 16 ul of Template Suppression Reagent (PE Applied Biosystems). The 
samples were heated at 95°C for 5 min then placed in the 310 Genetic Analyzer, 
yielding the sequence of SEQ ID NO: 81. 

nGPCR-40: PCR AND SUBCLONING 

PCR was performed in a 50 ul reaction containing utilizing Herculase DNA 

Polymerase blend (Stratagene), using the buffer recommendations provided by the 
manufacturer, 200 ng each of primers PSK 18 and 19 (SEQ ID NOS: 115 and 1 16), 
1 50 ng of human genomic DNA (Clontech), and 2% DMSO. The PCR reaction was 
performed on a Robocycler thermocycler (Stratagene) starting with 1 cycle of 94°C 
for 2 min followed by 35 cycles of 94°C for 30 sec, 65°C for 30 sec, 72°C for 2 min. 
The PCR reaction was purified using the QiaQuick PCR Purification Kit (Qiagen), 
and then eluted in TE. The PCR primer sequences were: 

PSK 18 GATC GAATTCGCAGGAGCAATG AAAATCAGGAAC (SEQ ID 
NO: 115), and: 

PSK19: GATCGAATTC TTATATATGTTCAGAAAACAAATTCATGG 
(SEQ ID NO: 1 16)). The underlined portion of the primer matches the 5' and 3' areas, 
respectively, of a portion of the 5' untranslated region and coding region. Initiation 
and termination codons are shown above in bold. 

The PCR product was ligated into the pCR-Bluntll-TOPO vector (Invitrogen) 
using the Zero Blunt Topo PCR TA cloning kit as follow: 3 u.1 PCR product DNA, 1 
ul pCRII-TOPO vector, and 1 ul TOPOII salt solution ( 1 .2M NaCl, 0.06M MgCl 2 ). 
The mixture was incubated for 5 minutes at room temperature. To the ligation 
reaction one microliter of 6X TOPO Cloning Stop Solution was added, and then the 
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reaction was placed on ice. Two microliters of the ligation reaction was transformed 
in One-Shot TOP 10 cells (Invitrogen), and placed on ice for 30 minutes. The cells 
were heat-shocked for 30 seconds at 42°C, placed on ice for two minutes, 250 /xl of 
SOC was added, then incubated at 37°C with shaking for one hour and then plated 
5 onto ampicillin plates supplemented with Xgal and 1PTG. Single colonies were 
screened by PCR for the presence of the insert, and a plasmid DNA from colony 58 
was purified using a Qiagen Endo-Free plasmid purification kit. 

nGPCR-40 was sequenced directly using an ABI377 fluorescence-based 
sequencer (Perkin Elmer/ Applied Biosystems Division, PE/ABD, Foster City, CA) 

10 and the ABI BigDye™ Terminator Cycle Sequencing Ready Reaction kit with Taq 
FS™ polymerase. Each ABI cycle sequencing reaction contained about 0.5 fig of 
plasmid DNA. Cycle-sequencing was performed using an initial denaturation at 98°C 
for 1 min, followed by 50 cycles: 96°C for 30 sec, annealing at 50°C for 30 sec, and 
extension at 60°C for 4 min. Temperature cycles and times were controlled by a 

15 Perkin-Elmer 9600 thermocycler. Extension products were purified using AGTC® 
gel filtration block (Edge BiosSystems, Gaithersburg, MD). Each reaction product 
was loaded by pipette onto the column, which was then centrifuged in a swinging 
bucket centrifuge (Sorvall model RT6000B tabletop centrifuge) at 1500 x g for 4 min 
at room temperature. Column-purified samples were dried under vacuum for about 

20 40 min and then dissolved in 3 u.1 of DNA loading solution (83% deionized 

formamide, 8.3 mM EDTA, and 1.6 mg/ml Blue Dextran). The samples were then 
heated to 90°C for 3.5 min and loaded into the gel sample wells for sequence analysis 
by the ABI377 sequencer. Sequence analysis was performed by importing ABI377 
files into the Sequencher program (Gene Codes, Ann Arbor, MI), which yielded a 

25 sequence identical to SEQ ID NO:83 with the exception that the nucleotide at position 
10 was identified as an "A" which incorrectly indicated the presence of an initiation 
codon at that position. Subsequent analysis of genomic DNA samples indicated that 
this position was incorrectly assigned and that the correct 

nucleotide at that position was a "C". The sequence reported at SEQ ID NO. 83 
30 correctly identifies the nucleotide at position 10 and indicates that the first initiation 
codon occurs at position 88-90. 
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nGPCR-54: PCR AND SUBCLONING 

Two microliters of a human genomic library (-10 8 PFU/ml) (Clontech) was 
added to 6 ml of an overnight culture of K802 cells (Clontech), then distributed as 250 
u.1 aliquots into each of 24 tubes. The tubes were incubated at 37°C for 15 min. 

5 Seven milliliters of 0.8% agarose was added to each tube, mixed, then poured onto 
LB agar + 10 mM MgS0 4 plates and incubated overnight at 37°C To each plate 5 ml 
of SM (0.1M NaCl, 8.1 mM MgS0 4 -7H 2 0, 50mM Tris-Cl (pH 7.5), 0.0001% gelatin) 
phage buffer was added and the top agarose was removed with a microscope slide and 
placed in a 50 ml centrifuge tube. A drop of chloroform was added and the tube was 

10 place in a 37 °C shaker for 1 5 min, then centrifuged for 20 mm at 4000 RPM (Sorvall 
RT6000 table top centrifuge) and the supernatant stored at 4°C as a stock solution. 

Two p.1 of phage from each tube was heated to 99°C for 4 min then cooled to 
10°C. Added to the phage was a PCR mix containing 8.8 ul H 2 0, 4 ul 5X Rapid- 
Load Buffer (Origene), 2 u1 lOxPCR buffer II (Perkin-Elmer), 2 ul 25 mM MgCl 2 , 

15 0.8 ul 10 mM dNTP, 0.12 ul LW1634 (1 ug/ul)(SEQ ID NO: 1 19), 0.12 ul LW1635 
(1 ug/ul)(SEQ ID NO: 120), 0.2 u.1 AmpliTaq Gold polymerase (Perkin Elmer). The 
PCR reaction involved 1 cycle at 95°C for 10 min followed by 35 cycles at 95°C for 
45 sec, 53.5°C for 2 min, 72°C for 45 sec. The reaction was loaded onto a 2 % 
agarose gel. From the tube that gave a PCR product of the correct size, 10 u.1 was 

20 used to set up five 1:10 dilutions that were plated onto LB agar + 10 mM MgS0 4 

plates and incubated overnight. A BA85 nitrocellulose filter (Schleicher 8c Schuell) 
was placed on top of each plate for 1 hour. The filter was removed, placed phage side 
up in a petri dish, and covered with 4 ml of SM for 1 5 min to elute the phage. One 
milliliter of SM was removed from each plate and used to set up a PCR reaction as 

25 above. The plate of the lowest dilution to give a PCR product was subdivided, filter- 
lifted and the PCR reaction was repeated. The series of dilutions and subdividing of 
the plate was continued until a single plaque was isolated that gave a positive PCR 
band. Once a single plaque was isolated, 10 ul phage supernatant was added to 100 
ul SM and 200 yi\ of K802 cells per plate with a total of 8 plates set up. The plates 

30 were incubated overnight at 37°C. The top agarose was removed by adding 8 ml of 
SM then scrapping off the agarose with a microscope slide and collected in a 
centrifuge tube. To the lube, 3 drops of chloroform was added, vortexed, incubated at 
37°C for 1 5 min then centrifuged for 20 min at 4000 RPM (Sorvall RT6000 table top 
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centrifuge) to recover the phage, which was used to isolate genomic phage DNA 
using the Qiagen Lambda Midi Kit. The sequence for primer LW1634 was: 



CTGAAAGTTGTCGCTGACC (SEQ ID NO: 119), and for primer LW1635 



CGATTATCCACACTTTGACCC (SEQ ID NO: 120). 

The PCR reaction for the coding region was performed in a 50 ul reaction 
containing 33 ul H 2 0, 5 ul 10X TT buffer (140 mM Ammonium Sulfate, 0.1 % 
gelatin, 0.6 M Tris-tricine pH 8.4), 5 ul 15 mM MgS0 4l 2 ul 10 mM dNTP, 4 ul 
genomic phage DNA (0.25 ug/ul), 0.3 ul LW1698 (1 ug/uI)(SEQ ID NO: 121), 0.3 ul 
LW1699 (1 ug/ul)(SEQ ID NO: 122), 0.4 ul High Fidelity Taq polymerase 
(Boehringer Mannheim). The PCR reaction was started with 1 cycle of 94°C for 2 
mm followed by 30 cycles at 94°C for 30 sec, 55°C for 30 sec, 68°C for 2 min. The 
PCR reaction was loaded onto a 2 % agarose gel. The DNA band was excised from 
the gel, placed in GenElute Agarose spin column (Supelco) and spun for 10 min at 
maximum speed. The eluted DNA was EtOH precipitated and resuspended in 8ul 
H2O. The PCR primer sequence for primer LW1698 was: 

GCATACC ATGAATGAGCCACTAGAC (SEQ ID NO: 121), and for primer 
LW 1699 was: 

GCATCTCGAG TCAAGGGTTGTTTGAGTAAC (SEQ ID NO: 122). The 
underlined portion of the primer matches the 5' and 3' areas, respectively, of the 
coding region of nGPCR-54. 

The ligation reaction used solutions from the TOPO TA Cloning Kit 
(Invitrogen) which consisted of 4ul PCR product DNA, 1 pJ of salt solution and 1 ul 
pCRIl-TOPO vector that was incubated for 5 minutes at room temperature then the 
reaction was placed on ice. Two microliters of the ligation reaction was transformed 
in One-Shot TOP 10 cells (Invitrogen), and placed on ice for 30 minutes. The cells 
were heat-shocked for 30 seconds at 42°C, placed on ice for two minutes, 250 u.1 of 
SOC was added, then incubated at 37°C with shaking for one hour and then plated 
onto ampicillin plates. A single colony containing an insert was used to inoculate a 5 
ml culture of LB medium. Plasmid DNA was purified using a Concert Rapid Plasmid 
Miniprep System (GibcoBRL) and then sequenced. 

nGPCR-54 genomic phage DNA was sequenced using the ABI PRISM IM 310 
Genetic Analyzer (PE Applied Biosystems) which uses advanced capillary 
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electrophoresis technology and the ABI PRISM™ BigDye™ Terminator Cycle 
Sequencing Ready Reaction Kit. The cycle-sequencing reaction contained 14 jal of 
H 2 0, 16 \i\ of BigDye Terminator mix, 7 genomic phage DNA (0.1 u.g/nl), and 3 \x\ 
primer (25 ng/ul). The reaction was performed in a Perkin-Elmer 9600 thermocycler 
5 at 95°C for 5 min, followed by 99 cycles of 95°C for 30 sec, 55°C for 20 sec, and 
60°C for 4 min. The product was purified using a Centriflex™ gel filtration 
cartridges, dried under vacuum, then dissolved in 16 ul of Template Suppression 
Reagent. The samples were heated at 95°C for 5 min then placed in the 31 0 Genetic 
Analyzer. 

10 The DNA subcloned into pCRII was sequenced using the ABI PRISM™ 310 

Genetic Analyzer (PE Applied Biosystems) which uses advanced capillary 
electrophoresis technology and the ABI PRISM™ BigDye™ Terminator Cycle 
Sequencing Ready Reaction Kit. Each cycle-sequencing reaction contained 6 yi\ of 
H 2 0, 8 ul of BigDye Terminator mix, 5 u.1 mini-prep DNA (0. 1 ug/u.1), and 1 u.1 

15 primer (25 ng/ul) and was performed in a Perkin-Elmer 9600 thermocycler with 25 
cycles of 96°C for 10 sec, 50°C for 10 sec, and 60°C for 4 min. The product was 
purified using a Centriflex™ gel filtration cartridge, dried under vacuum, then 
dissolved in 16 ul of Template Suppression Reagent (PE Applied Biosystems). The 
samples were heated at 95°C for 5 min then placed in the 310 Genetic Analyzer, 

20 yielding the sequence of SEQ ID NO: 85. 
nGPCR-56: PCR AND SUBCLONING 

The PCR reaction for the coding region of nGPCR-56 used components that 
come with PLATINUM® Pfx DNA Polymerase (GibcoBRL) containing 35.5 \x\ H 2 0, 
5 ul 10X Pfx Amplification buffer, 1.5 ul 50mM MgS0 4 , 2 ul 10 mM dNTP, 5 ul 

25 human genomic DNA (0.3ug/ul)(Clontech), 0.3 ul of LW1603 (1 (ig/^I)(SEQ ID NO: 
152), 0.3 ulof LW1604 (I ug/ul)(SEQ ID NO: 153), 0.4 ul PLATINUM® Pfx DNA 
Polymerase (2.5 U/Tl). The PCR reaction was performed in a Robocycler Gradient 
96 (Stratagene) starting with 1 cycle of 94°C for 5 min followed by 30 cycles at 94°C 
for 40 sec, 55°C for 2 min, 6S°C for 3 min. Following the final cycle, 0.5 ul of 

30 AmpliTaq DNA Polymerase (5 U/ul) was added and the tube was incubated at 72°C 
for 5 min. The sequence of LW1 603 is: 
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GATCAAGCTTGGA ATGATGCCCTTTTGCCAC (SEQ ID NO: 152), and 
for LW1604 is: 

GATCCTCGAGCA TCATTCAAAGTAGGTGG. (SEQ ED NO: 153). The 
underlined portion of the primer matches the 5' and 3' areas, respectively, of a portion 
of the coding region of nGPCR-56. 

The PCR reaction for the coding region was performed in a 50 ul reaction 
containing 32 ul H 2 0, 5 ul 10X TT buffer (140 mM Ammonium Sulfate, 0.1 % 
gelatin, 0.6 M Tris-tricine pH 8.4), 5 ul 15 mM MgS0 4 , 2 uJ 10 mM dNTP, 5 ul 
human genomic DNA (0.3ug/ul)(Clontech), 0.3 ul LW1603 (1 ug/ul)(SEQ ID NO: 
152), 0.3 ul LW1696 (1 ug/nl)(SEQ ID NO: 154), 0.4 ul High Fidelity Taq 
polymerase (Boehringer Mannheim). The PCR reaction was started with 1 cycle of 
94°C for 2 min followed by 25 cycles at 94°C for 40 sec, 55°C for 60 sec, 68°C for 2 
min. The PCR reaction was loaded onto a 2 % agarose gel. The DNA band was 
excised from the gel, placed in GenElute Agarose spin column (Supelco) and spun for 
10 min at maximum speed. The eluted DNA was EtOH precipitated and resuspended 
in 1 2ul H 2 0 for ligation. The PCR primer sequence for L W 1 603 is: 

GATCAAGCTTGGA ATGATGCCCTTTTGCCAC (SEQ ID NO: 152), and 
LW1696: 

GATCCTCGAG CTATGAACTCAATTCCAAAA ATAATTTACACC (SEQ 
ID NO: 154). The underlined portion of the primer matches the 5* and 3' areas, 
respectively, of a portion of the coding region. 

The ligation reaction used solutions from the TOPO TA Cloning Kit 
(Invitrogen) which consisted of 4ul PCR product DNA, 1 ul of salt solution and 1 ul 
pCRII-TOPO vector that was incubated for 5 minutes at room temperature then the 
reaction was placed on ice. Two microliters of the ligation reaction was transformed 
in One-Shot TOP 10 cells (Invitrogen), and placed on ice for 30 minutes. The cells 
were heat-shocked for 30 seconds at 42°C, placed on ice for two minutes, 250 ul of 
SOC was added, then incubated at 37°C with shaking for one hour and then plated 
onto ampicillin plates. A single colony containing an insert was used to inoculate a 5 
ml culture of LB medium. Plasmid DNA was purified using a Concert Rapid Plasmid 
Miniprep System (GibcoBRL) and then sequenced. 

The mutation in nGPCR-56 was repaired using the QuikChange Site-Directed 
Mutagenesis Kit (Stratagene). The PCR reaction contained 40 ul H20, 5 ^1 lOx 
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Reaction buffer, 1 pi mini-prep DNA, 1 ul LW1700 (125 ng/pl) (SEQ ID NO: 155), 
lplLW1701 (125 ng/pl) (SEQ ID NO: 156), lpl 10 mM dNTP, 1 pi Pfu DNA 
polymerase. The cycle conditions were 95°C for 30 sec then 14 cycles at 95°C for 30 
sec, 55°C for 1 min, 68°C for 12 min. The tube was placed on ice for 2 min, then 1 pi 
5 of Dpnl was added and the tube incubated at 37°C for one hour. One microliter of the 
Z)p«/-treated DNA was transformed into Epicurian coli XL 1 -Blue supercompetent 
cells and the entire insert was re-sequenced. The primer sequences are: 
GCTACTTGAACTCTACATTTAATCCAATGGTTTATGCATTTTTCTATCC 
(LWI700)(SEQ ID NO: 155), and: 

1 0 GG AT AG AAAAATGC ATAA ACC ATTGG ATT AAATGT AG AGTTC AAGTAGC 
(LW1701)(SEQ ID NO: 156). 

The DNA subcloned into pCRII was sequenced using the ABI PRISM™ 310 
Genetic Analyzer (PE Applied Biosystems) which uses advanced capillary 
electrophoresis technology and the ABI PRISM™ BigDye™ Terminator Cycle 

15 Sequencing Ready Reaction Kit. Each cycle-sequencing reaction contained 6 pi of 
H 2 0, 8 p] of BigDye Terminator mix, 5 pi mini-prep DNA (0. 1 pg/pl), and 1 pi 
primer (25 ng/pl) and was performed in a Perkin-Elmer 9600 thermocycler with 25 
cycles of 96°C for 10 sec, 50°C for 10 sec, and 60°C for 4 min. The product was 
purified using a Centriflex™ gel filtration cartridge, dried under vacuum, then 

20 dissolved in 16 pi of Template Suppression Reagent (PE Applied Biosystems). The 
samples were heated at 95°C for 5 min then placed in the 310 Genetic Analyzer, 
yielding the sequence of SEQ ID NO: 89. 
nGPCR-58: PCR AND SUBCLONING 

25 Isolation of a clone for nGPCR-58 from genomic DNA was performed by 

PCR in a 50 pi reaction containing Herculase DNA Polymerse blend (Stratagene), 
with buffer recommendations as supplied by the manufacturer, 200 ng each primers 
PSK14 (SEQ ID NO: 157) and PSK15 (SEQ ID NO: 158), 150 ng of human genomic 
DNA (Clontech) and 6% DMSO. The PCR reaction was performed on a Robocycler 

30 thermocycler (Stratagene) starting with 1 cycle of 94°C for 2 min followed by 35 
cycles of 94°C for 30 sec, 65°C for 30 sec, 72°C for 2 min. The PCR reaction was 
purified by the QiaQuick PCR Purification Kit (Qiagen) and eluted in TE. The PCR 
primer sequences were: 
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PSKI4: 5'GATCGAATTC ATGGACACTACCATGGAAGCTGACC (SEQ 
ID NO: 157), and: 

PSK15: 5 f GATCCTCGAG TCACGTGGGGCCTGCGCCCGG (SEP ID NO: 

158). 

The underlined portion of the primers match the 5' and 3' areas, respectively, 
of a portion of the 5' untranslated region and coding region. Translation initiation and 
termination codons are shown above in bold. 

The blunt ended PCR product was prepared for cloning by the addition of a 
single base "A" residue by AmpliTaq Gold (Perkin Elmer) in a reaction with IX PCR 
Buffer II, 1 mM MgCl 2 , 200uM each dATP, dGTP, dCTP, and dTTP. The reaction 
was incubated at 94°C for 10 minutes followed by 72°C for 10 minutes. The products 
were cloned into the pCRlI-TOPO vector (Invitrogen) using the TOPO TA cloning kit 
as follows: 3ul PCR product DNA , 1 ul pCRII-TOPO vector, and 1 ul TOPOII salt 
solution (1.2M NaCl, 0.06M MgCl 2 ) was incubated for 5 minutes at room 
temperature. To the ligation reaction one microliter of 6X TOPO Cloning Stop 
Solution was added then the reaction was placed on ice. Two microliters of the 
ligation reaction was transformed in One-Shot TOP 10 cells (Invitrogen), and placed 
on ice for 30 minutes. The cells were heat-shocked for 30 seconds at 42°C, placed on 
ice for two minutes, 250 pi of SOC was added, then incubated at 37°C with shaking 
for one hour and then plated onto ampicillin plates supplemented with X-gal and 
IPTG. Single colonies were screened by PCR for the presence of the insert, and a 
plasmid DNA from colony 58-6 was purified using a Qiagen Endo-Free plasmid 
purification kit and deposited as nGPCR-58. 

nGPCR-58 was sequenced directly using an ABI377 fluorescence-based 
sequencer (Perkin Elmer/Applied Biosystems Division, PE/ABD, Foster City, CA) 
and the ABT BigDye™ Terminator Cycle Sequencing Ready Reaction kit with Taq 
FS ,M polymerase. Each AB1 cycle sequencing reaction contained about 0.5 ug of 
plasmid DNA. Cycle-sequencing was performed using an initial denaturation at 98°C 
for 1 min, followed by 50 cycles: 96°C for 30 sec, annealing at 50°C for 30 sec, and 
extension at 60°C for 4 min. Temperature cycles and times were controlled by a 
Perkin-Elmer 9600 thermocycler. Extension products were purified using AGTC (R) 
gel filtration block (Edge BiosSystems, Gaithersburg, MD). Each reaction product 
was loaded by pipette onto the column, which was then centrifuged in a swinging 
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bucket centrifuge (Sorvall model RT6000B tabletop centnfuge) at 1500 x g for 4 min 
at room temperature. Column-purified samples were dried under vacuum for about 
40 min and then dissolved in 3 ul of a DNA loading solution (83% deionized 
formamide, 8.3 mM EDTA, and 1.6 mg/ml Blue Dextran). The samples were then 
heated to 90°C for 3.5 min and loaded into the gel sample wells for sequence analysis 
by the ABI377 sequencer. Sequence analysis was performed by importing ABI377 
files into the Sequencer program (Gene Codes, Ann Arbor, MI), yielding the sequence 
of SEQ ID NO: 93. 

EXAMPLE 3: HYBRIDIZATION ANALYSIS TO DEMONSTRATE nGPCR-X 
EXPRESSION IN BRAIN 

The expression of nGPCR-x in mammals, such as the rat, may be investigated 
by in situ hybridization histochemistry. To investigate expression in the brain, for 
example, coronal and sagittal rat brain cryosections (20 /xm thick) are prepared using 
a Reichert-Jung cryostat. Individual sections are thaw-mounted onto silanized, 
nuclease-free slides (CEL Associates, inc., Houston, TX), and stored at -80°C. 
Sections are processed starting with post-fixation in cold 4% paraformaldehyde, 
nnsed in cold phosphate-buffered saline (PBS), acetylated using acetic anhydride in 
triethanolamine buffer, and dehydrated through a series of alcohol washes in 70%, 
95%, and 100% alcohol at room temperature. Subsequently, sections are delipidated 
in chloroform, followed by rehydration through successive exposure to 100% and 
95% alcohol at room temperature. Microscope slides containing processed 
cryosections are allowed to air dry prior to hybridization. Other tissues may be 
assayed in a similar fashion. 

A nGPCR-x-specific probe is generated using PCR. Following PCR 
amplification, the fragment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 
the sense strand of nGPCR-x, the nGPCR-x clone in pBluescript II is linearized with a 
suitable restriction enzyme, which provides a substrate for labeled run-off transcripts 
(i.e., cRN A riboprobes) using the vector-borne T7 promoter and commercially 
available T7 RNA polymerase. A probe specific for the antisense strand of nGPCR-x 
is also readily prepared using the nGPCR-x clone in pBluescript II by cleaving the 
recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off c RNA transcripts using the T3 
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promoter and cognate polymerase. The riboprobes are labeled with [ 35 S]-UTP to 
yield a specific activity of about 0.40 x 10 6 cpm/pmol for antisense riboprobes and 
about 0.65 x 10 6 cpm/pmol for sense-strand riboprobes. Each riboprobe is 
subsequently denatured and added (2 pmol/ml) to hybridization buffer which 
contained 50% forrnamide, 10% dextran, 0.3 M NaCl, 10 mM Tris (pH 8.0), 1 mM 
EDTA, IX Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides 
containing sequential brain cryosections are independently exposed to 45 fil of 
hybridization solution per slide and silanized cover slips are placed over the sections 
being exposed to hybridization solution. Sections are incubated overnight (15-18 
hours) at 52°C to allow hybridization to occur. Equivalent series of cryosections are 
exposed to sense or antisense nGPCR-40-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 /Ag/rnl 
RNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 minutes at 37°C. The cryosections are then subjected to three high- 
stringency washes in 0.1 X SSC at 52°C for 20 minutes each. Following the series of 
washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, and 100% 
ammonium acetate in alcohol, followed by air drying and exposure to Kodak 
BioMax™ MR-1 film. After 13 days of exposure, the film is developed. Based on 
these results, slides containing tissue that hybridized, as shown by film 
auto radio grams, are coated with Kodak NTB-2 nuclear track emulsion and the slides 
are stored in the dark for 32 days. The slides are then developed and counterstained 
with hematoxylin. Emulsion-coated sections are analyzed microscopically to 
determine the specificity of labeling. The signal is determined to be specific if 
autoradiographic grains (generated by antisense probe hybridization) are clearly 
associated with cresyl violate-stained cell bodies. Autoradiographic grains found 
between cell bodies indicates non-specific binding of the probe. 

Expression of nGPCR-x in the brain provides an indication that modulators of 
nGPCR-x activity have utility for treating neurological disorders, including but not 
limited to, schizophrenia, affective disorders, ADHD/ADD (i.e., Attention Deficit- 
Hyperactivity Disorder/ Attention Deficit Disorder), and neural disorders such as 
Alzheimer's disease, Parkinson's disease, migraine, and senile dementia. Some other 
diseases for which modulators of nGPCR-x may have utility include depression, 
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anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and 
the like. Use of nGPCR-x modulators, including nGPCR-x ligands and anti-nGPCR- 
x antibodies, to treat individuals having such disease states is intended as an aspect of 
the invention. 

EXAMPLE 4: TISSUE EXPRESSION PROFILING 

Tissue specific expression of the cDNAs encoding nGPCR-1, nGPCR-3, 
nGPCR-9, nGPCR-1 1, nGPCR-1 6, nGPCR-40, nGPCR-54, nGPCR-56, and nGPCR- 
58 was detected using a PCR-based system. Tissue specific expression of cDNAs 
encoding nGPCR-x may be accomplished using similar methods. 

Primers were synthesized by Genosys Corp., The Woodlands, TX. PCR 
reactions were assembled using the components of the Expand Hi-Fi PCR System™ 
(Roche Molecular Biochemicals, Indianapolis, IN). 
nGPCR-1 

The RapidScan™ Gene Expression Panel was used to generate a 
comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues in the array may include: brain, heart, kidney, spleen, liver, colon, lung, small 
intestine, muscle, stomach, testis, placenta, salivary gland, thyroid, adrenal gland, 
pancreas, ovary, uterus, prostate, skin, PBL, bone marrow, fetal brain, fetal liver. 
Human brain regions in the array may include: frontal lobe, temporal lobe, 
cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, thalamus, 
hypothalamus, pons, medulla and spinal cord. 

Expression of the nGPCR-1 in the various tissues was detected by using PCR 
primers designed based on the available sequence of the receptor that will prime the 
synthesis of a 212bp fragment in the presence of the appropriate cDNA. The forward 
primer was: 

GCTCAACCCACTCATCTATGCC (SEQ ID NO: 97), and the reverse primer 

was: 

AAACTTCTCTGCCCTTACCGTC (SEQ ID NO: 98) 
The PCR reaction mixture was added to each well of the PCR plate. The plate was 
placed in a GeneAmp PCR9700 PCR thermocycler (Perkin Elmer Applied 
Biosystems). The plate was then exposed to the following cycling parameters: Pre- 
soak 94°C for 3 min; denaturation at 94°C for 30 seconds; annealing at primer T m for 
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45 seconds; extension 72°C for 2 minutes; for 35 cycles. PCR products were then 
separated and analyzed by electrophoresis on a 1 .5-% agarose gel. 

The 4-log dilution range of cDNA deposited on the plate ensured that the 
amplification reaction is within the linear range and, hence, facilitated the semi- 
quantitative determination of relative mRNA accumulation in the various tissues or 
brain regions examined. 

Expression of nGPCR-1 was found to be highest in the testis, adrenal gland 
and heart. Significant levels of expression were also found in the brain, kidney, 
spleen ovary, prostate, muscle, PBL, stomach and bone marrow. Within the brain, 
expression levels were highest in the cerebellum, amygdala, thalamus and spinal cord, 
with significant levels of expression in the frontal lobe, hippocampus, substantia 
nigra, hypothalamus and pons. 

Expression of nGPCR-1 in the brain provided an indication that modulators of 
nGPCR-1 activity have utility for treating neurological disorders, including but not 
limited to, schizophrenia, affective disorders, ADHD/ADD {i.e., Attention Deficit- 
Hyperactivity Disorder/ Attention Deficit Disorder), and neural disorders such as 
Alzheimer's disease, Parkinson's disease, migraine, and senile dementia. Some other 
diseases for which modulators of nGPCR-1 may have utility include depression, 
anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and 
the like. Use of nGPCR-1 modulators, including nGPCR-1 ligands and anti-nGPCR- 
1 antibodies, to treat individuals having such disease states is intended as an aspect of 
the invention. 
nGPCR-3 

Tissue specific expression of the cDNA encoding nGPCR-3 was detected 
using a PCR-based method. Multiple Choice™ first strand cDNAs (OriGene 
Technologies, Rockville, MD) from 6 human tissues were serially diluted over a 3-log 
range and arrayed into a multi-well PCR plate. This array was used to generate a 
comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues arrayed included: brain, heart, kidney, peripheral blood leukocytes, lung and 
testis. PCR primers were designed based on the available sequence of the putative 
GPCR. The sequence of the forward primer used was: 

5 'TGCTGCTTTGTTGCGCCTAC3 ' (SEQ ID NO: 189), corresponding to 
base pairs 77 through 96 of the predicted coding sequence of nGPCR-3. The 
sequence of the reverse pnmer used was: 
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5 'TTGGACGCC AGG AAGGTG3 * (SEQ ID NO: 190), corresponding to base 
pairs 258 through 285 of the predicted coding sequence of nGPCR-3. This primer set 
primes the synthesis of a 298 base pair fragment in the presence of the appropriate 
cDNA. For detection of expression within brain regions, the same primer set was 
5 used with the Human Brain Rapid Scan™ Panel (OriGene Technologies, Rockville, 
MD) This panel represents serial dilutions over a 3 log range of first strand cDNA 
from the following brain regions arrayed in a 96 well format: frontal lobe, temporal 
lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, 
thalamus, hypothalamus, pons, medulla and spinal cord. Primers were synthesized by 

10 Genosys Corp., The Woodlands, TX. PCR reactions were assembled using the 

components of the Expand Hi-Fi PCR System™ (Roche Molecular Biochemicals, 
Indianapolis, IN). Twenty-five microliters of the PCR reaction mixture was added to 
each well of the RapidScan PCR plate. The plate was placed in a GeneAmp 9700 
PCR thermocycler (Perkin Elmer Applied Biosystems). The following cycling 

15 program was executed: Pre-soak at (94°C for 3min.) followed by 35 cycles of [(94°C 
for 45 sec), (53°C for 2 min.), and (72°C for 45 sec.)]. PCR reaction products were 
then separated and analyzed by electrophoresis on a 2.0% agarose gel stained with 
ethidium bromide. 

The results indicated that nGPCR-3 was expressed in the brain, heart, kidney, 
20 peripheral blood lymphocytes, lung, and testis. In the brain, nGPCR-3 was expressed 
in frontal lobe, temporal lobe, cerebellum, hippocampus, substantia nigra, caudate 
nucleus, amygdala, thalamus, hypothalamus, pons, medulla, as well as in the spinal 
cord. 

nGPCR-9 

25 The RapidScan™ Gene Expression Panel was used to generate a 

comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues arrayed include: brain, heart, kidney, spleen, liver, colon, lung, small intestine, 
muscle, stomach, testis, placenta, salivary gland, thyroid, adrenal gland, pancreas, 
ovary, uterus, prostate, skin, PBL, bone marrow, fetal brain, fetal liver. 

30 The forward primer used was to detect expression of nGPCR-9 was: 

5' AACCCCATCATCTACACGC 3 '(SEQ ID NO: 105), and, the reverse 
primer was: 
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5' TGCCTGTGGAGCCGCTGG 3'(SEQ ID NO: 106). This primer set will 
prime the synthesis of a 238 base pair fragment in the presence of the appropriate 



For detection of expression within brain regions, the same primer set was used 
with the Human Brain Rapid Scan IM Panel (OriGene Technologies, Rockville, MD). 
This panel represents serial dilutions over a 2-log range of first strand cDNA from the 
following brain regions arrayed in a 96 well format: frontal lobe, temporal lobe, 
cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, thalamus, 
hypothalamus, pons, medulla and spinal cord. 

Twenty-five microliters of the PCR reaction mixture was added to each well 
of the PCR plate. The plate was placed in a GeneAmp 9700 PCR thermocycler 
(Perkin Elmer Applied Biosystems). The following cycling program was executed: 
Pre-soak at (94°Cfor 3 min.) followed by 35 cycles of [(94°Cfor 45 sec.) (52°C for 2 
min.) (72°Cfor 45 sec.)]. PCR reaction products were then separated and analyzed by 
electrophoresis on a 2.0% agarose gel and stained with ethidium bromide. 

nGPCR-9 was expressed in the brain, peripheral blood leukocytes, heart, 
kidney, adrenal gland, spleen, pancreas, liver, lung, skin, bone marrow, testis, 
placenta, salivary gland, uterus, small intestine, muscle, stomach, and fetal liver. 
Within the brain, nGPCR-9 was expressed in all areas examined including the frontal 
lobe, temporal lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, 
amygdala, thalamus, hypothalamus, pons, medulla and spinal cord. 

Expression of nGPCR-9 in the brain provided an indication that modulators of 
nGPCR-9 activity have utility for treating disorders, including but not limited to, 
schizophrenia, affective disorders, movement disorders, metabolic disorders, 
inflammatory disorders, cancers, ADHD/ADD (i.e., Attention Deficit-Hyperactivity 
Disorder/Attention Deficit Disorder), and neural disorders such as Alzheimer's 
disease, Parkinson's disease, migraine, and senile dementia. Use of nGPCR-9 
modulators, including nGPCR-9 ligands and anti-nGPCR-9 antibodies, to treat 
individuals having such disease states is intended as an aspect of the invention. 
nGPCR-11 

The RapidScan™ Gene Expression Panel was used to generate a 
comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues in the array included, inter alia: brain, heart, kidney, spleen, liver, colon, lung, 
small intestine, muscle, stomach, testis, placenta, salivary gland, thyroid, adrenal 
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gland, pancreas, ovary, uterus, prostate, skin, PBL, bone marrow, fetal brain, fetal 
liver. Human brain regions in the array included, inter alia: frontal lobe, temporal 
lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, 
thalamus, hypothalamus, pons, medulla and spinal cord. 
5 Expression of nGPCR-1 1 in the various tissues was detected by using PCR 

primers designed based on the available sequence of the receptor that will prime the 
synthesis of a 206bp fragment in the presence of the appropriate cDNA. The forward 
primer used to detect expression of nGPCR-1 1 was: 

S'-GAAGCCCAGCACTGTTTACC^* (SEQ ID NO: 109), and the reverse 

10 primer was: 

5 *-TG AAATACCTGTCCGC AGCC-3 (SEQ ID NO: 110). 
Twenty-five microliters of the PCR reaction mixture was added to each well of the 
RapidScan PCR plate. The plate was placed in a GeneAmp 9700 PCR thermocycler 
(PE Applied Biosystems). The following cycling program was executed: Pre-soak 

15 94°C for 3 min; denaturation at 94°C for 30 seconds; annealing at primer T m for 45 
seconds; extension at 72°C for 2 minutes; for 35 cycles. PCR reaction products were 
then separated and analyzed by electrophoresis on a 2.0% agarose gel stained with 
ethidium bromide. 

The 4-log dilution range of cDNA deposited on the plate ensured that the 

20 amplification reaction was within the linear range and, facilitated semi-quantitative 
determination of relative mRNA accumulation in the various tissues or brain regions 
examined. 

nGPCR-1 1 was expressed in the thyroid gland, brain, heart, kidney, adrenal 
gland, spleen, liver, ovary, muscle, testis, salivary gland, colon, prostate, small 

25 intestine, skin stomach, bone marrow, fetal brain and placenta. Within the brain, 
nGPCR-1 1 was expressed in the temporal lobe, amygdala, substantia nigra, pons, 
spinal cord, frontal lobe, and cerebellum. 

Expression of the nGPCR-1 1 in the brain provided an indication that 
modulators of nGPCR-1 1 activity have utility for treating disorders, including but not 

30 limited to, schizophrenia, affective disorders, metabolic disorders, inflammatory 
disorders, cancers, ADHD/ ADD (i.e., Attention Deficit-Hyperactivity 
Disorder/Attention Deficit Disorder), and neural disorders such as Alzheimer's 
disease, Parkinson's disease, migraine, and senile dementia. Some other diseases for 
which modulators of nGPCR-1 1 may have utility include depression, anxiety, bipolar 
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disease, epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of 
nGPCR-1 1 modulators, including nGPCR-1 1 ligands and anti-nGPCR- 1 1 antibodies, 
to treat individuals having such disease states is intended as an aspect of the 
invention. 

Expression of nGPCR-1 1 in the thyroid gland, indicates that agonists or 
antagonists could be of use in the treatment of thyroid dysfunction such as 
thyreotoxicosis and myxoedema. They could also be of use in the stimulation of 
thyroid hormone release leading to overall increase in metabolic rate and weight 
reduction. The expression of nGPCR-1 1 in liver and muscle indicate a use for 
agonists or antagonists in regulation of glucose metabolism applicable in diabetes 
type II. 
nGCPR-16 

The RapidScan™ Gene Expression Panel was used to generate a 
comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues in the array included, inter alia: brain, heart, kidney, spleen, liver, colon, lung, 
small intestine, muscle, stomach, testis, placenta, salivary gland, thyroid, adrenal 
gland, pancreas, ovary, uterus, prostate, skin, PBL, bone marrow, fetal brain, fetal 
liver. Human brain regions in the array included, inter alia: frontal lobe, temporal 
lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, 
thalamus, hypothalamus, pons, medulla and spinal cord. 

Expression of nGPCR-1 6 in the various tissues was detected by using PCR 
primers designed based on the available sequence of the receptor that will prime the 
synthesis of a 205bp fragment in the presence of the appropriate cDNA. The forward 
primer used to detect expression of nGPCR-1 6 was: 

5' CAGCCCAAACATCCAAGTC 3\ (SEQ ID NO: 113). The reverse 
primer used to detect expression of nGPCR-1 6 was: 

5 > ACCCCACTTAATCAGCCTC 3\SEQ ID NO: 1 14). 

For detection of expression within brain regions, the same primer set was used 
with the Human Brain Rapid Scan IM Panel (OriGene Technologies, Rockville, MD). 
This panel represents serial dilutions over a 2 log range of first strand cDNA from the 
following brain regions arrayed in a 96 well format: frontal lobe, temporal lobe, 
cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, thalamus, 
hypothalamus, pons, medulla and spinal cord. 
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Twenty-five microliters of the PCR reaction mixture was added to each well 
of the RapidScan PCR plate. The plate was placed in a GeneAmp 9700 PCR 
thermocycler (Perkin Elmer Applied Biosystems). The following cycling program 
was executed: Pre-soak at (94° for 3min.) followed by 35 cycles of [(94°C for 45 sec.) 
5 (53°C for 2 min.) (72°C for 45 sec.)]. PCR reaction products were then separated and 
analyzed by electrophoresis on a 2.0% agarose gel, and stained with ethidium 
bromide. 

The 4-Iog dilution range of cDNA deposited on the plate ensured that the 
amplification reaction was within the linear range and, facilitated semi -quantitative 
10 determination of relative mRNA accumulation in the various tissues or brain regions 
examined. 

nGPCR-16 was expressed in the ovary, lung, prostate, bone marrow, salivary 
gland, heart, adrenal gland, spleen, liver, small intestine, skin, muscle, peripheral 
blood leukocytes, testis, placenta, fetal liver, brain, thyroid gland, kidney, pancreas, 
15 colon, uterus, and stomach.. Within the brain, nGPCR-16 was expressed in all areas 
examined including the frontal lobe, temporal lobe, cerebellum, hippocampus, 
substantia nigra, caudate nucleus, amygdala, thalamus, hypothalamus, pons, medulla 
and spinal cord. 

Expression of nGPCR-16 in the brain provides an indication that modulators 
20 of nGPCR-16 activity have utility for treating neurological disorders, including but 
not limited to, schizophrenia, affective disorders, ADHD/ ADD (i.e.. Attention 
Deficit-Hyperactivity Disorder/Attention Deficit Disorder), and neural disorders such 
as Alzheimer's disease, Parkinson's disease, migraine, and senile dementia. Some 
other diseases for which modulators of nGPCR-16 may have utility include 
25 depression, anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, neuropathy, 

neuroses, and the like. Use of nGPCR-16 modulators, including nGPCR-16 ligands 
and anti-nGPCR-16 antibodies, to treat individuals having such disease states is 
intended as an aspect of the invention. 
rtGPCR-40 

30 The RapidScan™ Gene Expression Panel (OriGene Technologies, Rockville, 

MD) was used to generate a comprehensive expression profile of the putative GPCR 
in human tissues. Human tissues arrayed include: brain, heart, kidney, spleen, liver, 
colon, lung, small intestine, muscle, stomach, testis, placenta, salivary gland, thyroid, 
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adrenal gland, pancreas, ovary, uterus, prostate, skin, PBL, bone marrow, fetal brain, 
fetal liver. The forward primer used was: 

5 ' ACAGCCCCAAAGCCAAAC AC3 ' , (SEQ ID NO: 1 1 7), and the reverse 
primer was: 

5 'CCGCAGGAGC AATGAAAATCAG3 \ (SEQ ID NO: 118). This primer 
set primed the synthesis of a 220 base pair fragment in the presence of the appropriate 
cDNA. For detection of expression within brain regions, the same primer set was 
used with the Human Brain RapidScan™ Panel (OriGene Technologies, Rockville, 
MD). This panel represents serial dilutions over a 2 log range of first strand cDNA 
from the following brain regions arrayed in a 96 well format: frontal lobe, temporal 
lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, 
thalamus, hypothalamus, pons, medulla and spinal cord. 

Twenty- five microliters of the PCR reaction mixture was added to each well 
of the RapidScan PCR plate. The plate was placed in a GeneAmp 9700 PCR 
thermocycler (Perkin Elmer Applied Biosystems). The following cycling program 
was executed: Pre-soak at (94°C for 3min.) followed by 35 cycles of [(94° for 45 sec.) 
(54°C for 2 min.) (72° for 45 sec.)]. PCR reaction products were then separated and 
analyzed by electrophoresis on a 2.0% agarose gel stained with ethidium bromide. 

The dilution range of cDNA deposited on the plates ensured that the 
amplification reaction was within the linear range and, hence, facilitated semi- 
quantitative determination of relative mRNA accumulation in the various tissues or 
brain regions examined. 

nGPCR-40 was expressed in the brain, peripheral blood lymphocytes, 
pancreas, ovary, uterus, testis, salivary gland, kidney, adrenal gland, liver, bone 
25 marrow, prostate, fetal liver, colon, muscle, and fetal brain, may be found in many 
other tissues, including, but not limited to, lung, small intestine, fetal brain cord, and 
bone. Within the brain, nGPCR-40 was expressed in the frontal lobe, hypothalamus, 
pons, cerebellum, caudate nucleus, and medulla. 

Expression of nGPCR-40 in the brain provides an indication that modulators 
of nGPCR-40 activity have utility for treating neurological disorders, including but 
not limited to, movement disorders, affective disorders, metabolic disorders, 
inflammatory disorders and cancers. Use of nGPCR-40 modulators, including 
nGPCR-40 ligands and anti-nGPCR-40 antibodies, to treat individuals having such 
disease states is intended as an aspect of the invention. 



20 



30 



120 



WO 01/36473 



PCT/US00/3I581 



dGPCR-54 

Multiple Choice™ first strand cDNAs (OriGene Technologies, Rockville, 
MD) from 12 human tissues were serially diluted over a 3-log range and arrayed into 
a multi-well PCR plate. Human tissues arrayed include: brain, heart, kidney, 
5 peripheral blood leukocytes, liver, lung, muscle, ovary, prostate, small intestine, 
spleen and testis. PCR primers were designed based on the sequence of nGPCR-54 
provided herein. The forward primer used was: 

5'CTGTCTCTCTGTCCTCTTCC3\(SEQ ID NO: 123). The reverse primer 
used was: 

10 5 'GC ACCG ATCTTC ATTG AATTTC3 ',(SEQ ID NO: 124). This primer set 

primes the synthesis of a 145 base pair fragment in the presence of the appropriate 
cDNA. For detection of expression within brain regions, the same primer set was 
used with the Human Brain Rapid Scan™ Panel (OriGene Technologies, Rockville, 
MD). This panel represents serial dilutions over a 3 log range of first strand cDNA 

15 from the following brain regions arrayed in a 96 well format: frontal lobe, temporal 
lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, 
thalamus, hypothalamus, pons, medulla and spinal cord. 

Twenty-five microliters of the PCR reaction mixture was added to each well 
of the RapidScan PCR plate. The plate was placedin a GeneAmp 9700 PCR 

20 thermocycler (Perkin Elmer Applied Biosystems). The following cycling program 
was executed: Pre-soak at (94°C for 3min.) followed by 35 cycles of [(94°C for 45 
sec.) (52.5°C for 2 min.) (72°C for 45 sec.)]. PCR reaction products were then 
separated and analyzed by electrophoresis on a 2.0% agarose gel stained with 
ethidium bromide. 

25 nGPCR-54 was expressed in the brain, kidney, lung, muscle, testis, heart, 

liver, ovary, prostate, small intestine, spleen, and peripheral blood leukocytes. Within 
the brain, nGPCR-54 was expressed in the cerebellum, hippocampus, substantia nigra, 
thalamus, hypothalamus, pons, frontal lobe, temporal lobe, caudate nucleus, medulla, 
spinal cord, and amygdala. 

30 Expression of the nGPCR-54 in the brain provides an indication that 

modulators of nGPCR-54 activity have utility for treating neurological disorders, 
including but not limited to, movement disorders, affective disorders, metabolic 
disorders, inflammatory disorders and cancers. Use of nGPCR-54 modulators, 
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including nGPCR-54 ligands and anti-nGPCR-54 antibodies, to treat individuals 
having such disease states is intended as an aspect of the invention. 
nGPCR-56 

The RapidScan rM Gene Expression Panel was used to generate a 
comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues arrayed include: brain, heart, kidney, spleen, liver, colon, lung, small intestine, 
muscle, stomach, testis, placenta, salivary gland, thyroid, adrenal gland, pancreas, 
ovary, uterus, prostate, skin, PBL, bone marrow, fetal brain, fetal liver. The forward 
primer used was: 

5' ACTTCAAACAACTTCATACCCC 3' (SEQ ID NO: 125), and the reverse 
primer used was: 

5 * ACACACAGCATAGTAGCG 3* (SEQ ID NO: 126). This primer set will 
prime the synthesis of a 231 base pair fragment in the presence of the appropriate 
cDNA. For detection of expression within brain regions, the same primer set was 
used with the Human Brain Rapid Scan™ Panel (OriGene Technologies, Rockville, 
MD). This panel represents serial dilutions over a 2 log range of first strand cDNA 
from the following brain regions arrayed in a 96 well format: frontal lobe, temporal 
lobe, cerebellum, hippocampus, substantia nigra, caudate nucleus, amygdala, 
thalamus, hypothalamus, pons, medulla and spinal cord. 

Twenty-five microliters of the PCR reaction mixture was added to each well 
of the RapidScan PCR plate. The plate was placed in a GeneAmp 9700 PCR 
thermocycler (Perkin Elmer Applied Biosystems). The following cycling program 
was executed: Pre-soak at (94°C for 3min.) followed by 35 cycles of [(94°C for 45 
sec.) (53°C for 2 min.) (72°C for 45 sec.)]. PCR reaction products were then 
separated and analyzed by electrophoresis on a 2.0% agarose gel stained with 
ethidium bromide. 

nGPCR-56 was expressed in peripheral blood lymphocytes, testis, salivary 
gland, kidney, spleen, skin, stomach, placenta, ovary, bone marrow, fetal liver, small 
intestine, and fetal brain. 

Expression of nGPCR-56 in the brain provides an indication that modulators 
of nGPCR-56 activity have utility for treating neurological disorders, including but 
not limited to, movement disorders, affective disorders, metabolic disorders, 
inflammatory disorders and cancers. Use of nGPCR-56 modulators, including 
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nGPCR-56 ligands and anti-nGPCR-56 antibodies, to treat individuals having such 

disease states is intended as an aspect of the invention. 

nGPCR-58 

The RapidScan™ Gene Expression Panel was used to generate a 
5 comprehensive expression profile of the putative GPCR in human tissues. Human 
tissues in the array included: brain, heart, kidney, spleen, liver, lung, small intestine, 
muscle, testis, ovary, prostate, and PBL. Human brain regions in the array included: 
frontal lobe, temporal lobe, cerebellum, hippocampus, substantia nigra, caudate 
nucleus, amygdala, thalamus, hypothalamus, pons, medulla and spinal cord. 
10 Expression of the nGPCR-58 in the various tissues was detected by using PCR 

primers designed based on the available sequence of the receptor that will prime the 
synthesis of a 282bp fragment in the presence of the appropriate cDNA. The forward 
primer was: 

CAGAGCTTGATGATGAGGAC (SEQ ID NO: 127), and the reverse 

15 primer was: 

CCCATAGGAAGTAGTAGAAG (SEQ ID NO: 128). 
The PCR reaction mixture was added to each well of the PCR plate. The plate 
was placed in a GeneAmp PCR9700 PCR thermocycler (Perkin Elmer Applied 
Biosystems). The plate was then exposed to the following cycling parameters: Pre- 
20 soak 94° for 3 min; denaturation at 94° for 30 seconds; annealing at primer T m for 45 
seconds; extension at 72° for 2 minutes; for 35 cycles. PCR productions were then 
separated and analyzed by electrophoresis on a 1 .5-% agarose gel. 

The 4-log dilution range of cDNA deposited on the plate ensured that the 
amplification reaction was within the linear range and, hence, facilitated semi- 
25 quantitative determination of relative mRNA accumulation in the various tissues or 
brain regions examined. 

nGPCR-58 was expressed in all tissues included on the array, including brain, 
muscle, prostate, kidney, peripheral blood lymphocytes, liver, lung, small intestine, 
spleen, testis, heart, and ovary. Within the brain, nGPCR-58 was expressed in many 
30 regions including, but not limited to cerebellum, substantia nigra, thalamus, pons, 
spinal cord, frontal lobe, temporal lobe, hippocampus, caudate nucleus, amygdala, 
hypothalamus, and medulla. 

Expression of the nGPCR-58 in the brain provided an indication that 
modulators of nGPCR-58 activity have utility for treating disorders, including but not 
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limited to, schizophrenia, affective disorders, ADHD/ ADD (i.e., Attention Deficit- 
Hyperactivity Disorder/ Attention Deficit Disorder), neural disorders such as 
Alzheimer's disease, Parkinson's disease, migraine, senile dementia, depression, 
anxiety, bipolar disease, epilepsy, neuritis, neurasthenia, neuropathy, neuroses, 
5 metabolic disorders, inflammatory disorders, cancers and the like. Use of nGPCR-58 
modulators, including nGPCR-58 ligands and anti-nGPCR-58 antibodies, to treat 
individuals having such disease states is intended as an aspect of the invention. 



EXAMPLE 5: NORTHERN BLOT ANALYSIS 

10 Northern blots are performed to examine the expression of nGPCR-x mRNA. 

The sense orientation oligonucleotide and the antisense-orientation oligonucleotide, 
described above, are used as primers to amplify a portion of the GPCR-x cDNA 
sequence of an odd numbered nucleotide sequence ranging from SEQ ID NO: 1 to 
SEQ ID NO: 93 and SEQ ID NO: 1 85. 

15 Multiple human tissue northern blots from Clontech (Human II # 7767-1) are 

hybridized with the probe. Pre-hybridization is carried out at 42 C for 4 hours in 
5xSSC, IX Denhardt's reagent, 0.1% SDS, 50% formamide, 250 mg/ml salmon 
sperm DNA. Hybridization is performed overnight at 42°C in the same mixture with 
the addition of about 1.5xl0 6 cpm/ml of labeled probe. 

20 The probe is labeled with a- 32 P-dCTP by Rediprime™ DNA labeling system 

(Amersham Pharmacia), purified on Nick Column™ (Amersham Pharmacia) and 
added to the hybridization solution. The filters are washed several times at 42 C in 
0.2x SSC, 0.1%) SDS. Filters are exposed to Kodak XAR film (Eastman Kodak 
Company, Rochester, N.Y., USA) with intensifying screen at -80°C. 

25 

EXAMPLE 6: RECOMBINANT EXPRESSION OF nGPCR-X IN 
EUKARYOTIC HOST CELLS 

A. Expression of nGPCR-x in Mammalian Cells 

To produce nGPCR-x protein, a nGPCR-x-encoding polynucleotide is 
30 expressed in a suitable host cell using a suitable expression vector and standard 
genetic engineering techniques. For example, the nGPCR-x-encoding sequence 
described in Example 1 is subcloned into the commercial expression vector pzeoSV2 
(Invitrogen, San Diego, CA) and transfected into Chinese Hamster Ovary (CHO) cells 
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using the transfection reagent FuGENE6™ (Boehnnger-Mannheim) and the 
transfection protocol provided in the product insert. Other eukaryotic cell lines, 
including human embryonic kidney (HEK 293) and COS cells, are suitable as well. 
Cells stably expressing nGPCR-x are selected by growth in the presence of 100 Mg/ml 
5 zeocin (Stratagene, LaJolla, CA). Optionally, nGPCR-x may be purified from the 

cells using standard chromatographic techniques. To facilitate purification, antisera is 
raised against one or more synthetic peptide sequences that correspond to portions of 
the nGPCR-x amino acid sequence, and the antisera is used to affinity purify nGPCR- 
x. The nGPCR-x also may be expressed in-frame with a tag sequence (e.g., 
10 polyhistidine, hemagluttinin, FLAG) to facilitate purification. Moreover, it will be 
appreciated that many of the uses for nGPCR-x polypeptides, such as assays 
described below, do not require purification of nGPCR-x from the host cell. 



B. Expression of nGPCR-x in 293 cells 

15 For expression of nGPCR-x in mammalian cells 293 (transformed human, 

primary embryonic kidney cells), a plasmid bearing the relevant nGPCR-x coding 
sequence is prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A 
contains the murine IgK chain leader sequence for secretion, the c-myc epitope for 
detection of the recombinant protein with the anti-myc antibody, a C-terminal 

20 polyhistidine for purification with nickel chelate chromatography, and a Zeocin 
resistant gene for selection of stable transfectants. The forward primer for 
amplification of this GPCR cDNA is determined by routine procedures and preferably 
contains a 5' extension of nucleotides to introduce the Hindlll cloning site and 
nucleotides matching the GPCR sequence. The reverse primer is also determined by 

25 routine procedures and preferably contains a 5' extension of nucleotides to introduce 
an Xhol restriction site for cloning and nucleotides corresponding to the reverse 
complement of the nGPCR-x sequence. The PCR conditions are 55°C as the 
annealing temperature. The PCR product is gel purified and cloned into the Hindlll- 
Xhol sites of the vector. 

30 The DNA is purified using Qiagen chromatography columns and transfected 

into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 
Indianapolis, IN). Transiently transfected cells are tested for expression after 24 
hours of transfection, using western blots probed with anti-His and anti-nGPCR-x 
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peptide antibodies. Permanently transfected cells are selected with Zeocin and 
propagated. Production of the recombinant protein is detected from both cells and 
media by western blots probed with anti-His, anti-Myc or anti-GPCR peptide 
antibodies. 

C. Expression of nGPCR-x in COS cells 

For expression of the nGPCR-x in COS7 cells, a polynucleotide molecule 
having an odd numbered nucleotide sequence ranging from SEQ ED NO: 1 to SEQ ID 
NO: 93 and SEQ ID NO: 185 can be cloned into vector p3-CI. This vector is a 
pUCl 8-derived plasmid that contains the HCMV (human cytomegalovirus) promoter- 
intron located upstream from the bGH (bovine growth hormone) polyadenylation 
sequence and a multiple cloning site. In addition, the plasmid contains the dhrf 
(dihydro folate reductase) gene which provides selection in the presence of the drug 
methotrexane (MTX) for selection of stable transform ants. 

The forward primer is determined by routine procedures and preferably 
contains a 5' extension which introduces an Xbal restriction site for cloning, followed 
by nucleotides which correspond to an odd numbered nucleotide sequence ranging 
from SEQ rD NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185. The reverse primer is 
also determined by routine procedures and preferably contains 5'- extension of 
nucleotides which introduces a Sail cloning site followed by nucleotides which 
correspond to the reverse complement of an odd numbered nucleotide sequence 
ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 185. The PCR 
consists of an initial denaturation step of 5 min at 95°C 30 cycles of 30 sec 
denaturation at 95°C, 30 sec annealing at 58°C and 30 sec extension at 72°C, 
followed by 5 min extension at 72°C. The PCR product is gel purified and ligated 
into the Xbal and Sail sites of vector p3-CI. This construct is transformed into E. coli 
cells for amplification and DNA purification. The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
after transfection, the media and the cells are tested for recombinant protein 
expression. 

nGPCR-x expressed from a COS cell culture can be purified by concentrating 
the cell-growth media to about 1 0 mg of protein/ml, and purifying the protein by, for 
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example, chromatography. Purified nGPCR-x is concentrated to 0.5 mg/ml in an 
Amicon concentrator fitted with a YM-10 membrane and stored at -80°C. 

D. Expression of nGPCR-x in Insect Cells 
5 For expression of nGPCR-x in a baculovirus system, a polynucleotide 

molecule having an odd numbered nucleotide sequence ranging from SEQ ED NO: 1 
to SEQ ID NO: 93 and SEQ ID NO: 185 can be amplified by PCR. The forward 
primer is determined by routine procedures and preferably contains a 5' extension 
which adds the Ndel cloning site, followed by nucleotides which correspond to an odd 

10 numbered nucleotide sequence ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and 
SEQ ID NO: 1 85. The reverse primer is also determined by routine procedures and 
preferably contains a 5' extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of an odd numbered 
nucleotide sequence ranging from SEQ ID NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 

15 185. 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcHTL expression vector contains the strong polyhedrin promoter of the 
Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag 

20 upstream from the multiple cloning site. A protein kinase site for phosphorylation 
and a thrombin site for excision of the recombinant protein precede the multiple 
cloning site is also present. Of course, many other baculovirus vectors could be used 
in place ofpAcHTL-A, such as pAc373, pVL941 andpAchMl. Other suitable 
vectors for the expression of GPCR polypeptides can be used, provided that the vector 

25 construct includes appropriately located signals for transcription, translation, and 

trafficking, such as an in-frame AUG and a signal peptide, as required. Such vectors 
are described in Luckow et at., Virology 1 70:3 1 -39, among others. 

The virus is grown and isolated using standard baculovirus expression 
methods, such as those described in Summers et al. (A Manual of Methods for 

30 Baculovirus Vectors and Insect Cell Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555 (1987)). 

In a preferred embodiment, pAcHLT-A containing nGPCR-x gene is 
introduced into baculovirus using the "BaculoGold™" transfection kit (Pharmingen, 
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San Diego, CA) using methods established by the manufacturer. Individual virus 
isolates are analyzed for protein production by radiolabeling infected cells with 35 S- 
methionine at 24 hours post infection. Infected cells are harvested at 48 hours post 
infection, and the labeled proteins are visualized by SDS-PAGE. Viruses exhibiting 
5 high expression levels can be isolated and used for scaled up expression. 

For expression of a nGPCR-x polypeptide in a Sf9 cells, a polynucleotide 
molecule having the sequence of an odd numbered nucleotide sequence ranging from 
SEQ ED NO: 1 to SEQ ID NO: 93 and SEQ ID NO: 1 85 can be amplified by PCR 
using the primers and methods described above for baculovirus expression. The 

10 nGPCR-x cDNA is cloned into vector pAcHLT-A (Pharmingen) for expression in Sf9 
insect. The insert is cloned into the Ndel and Kpnl sites, after elimination of an 
internal Ndel she (using the same primers described above for expression in 
baculovirus). DNA is purified with Qiagen chromatography columns and expressed 
in Sf9 cells. Preliminary Western blot experiments from non-purified plaques are 

15 tested for the presence of the recombinant protein of the expected size which reacted 
with the GPCR-specific antibody. These results are confirmed after further 
purification and expression optimization in HiG5 cells. 

EXAMPLE 7: INTERACTION TRAP/TWO-HYBRID SYSTEM 

20 In order to assay for nGPCR-x-interacting proteins, the interaction trap/two- 

hybrid library screening method can be used. This assay was first described in Fields 
el al. y Nature, 1989, 340, 245, which is incorporated herein by reference in its 
entirety. A protocol is published in Current Protocols in Molecular Biology 1999, 
John Wiley & Sons, NY, and Ausubel, F. M. et al. 1992, Short protocols in molecular 

25 biology, Fourth edition, Greene and Wiley-interscience, NY, each of which is 

incorporated herein by reference in its entirety. Kits are available from Clontech, 
Palo Alto, CA (Matchmaker Two-Hybrid System 3). 

A fusion of the nucleotide sequences encoding all or partial nGPCR-x and the 
yeast transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 

30 appropriate plasmid (i.e., pGBKT7) using standard subcloning techniques. Similarly, 
a GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e., 
pGADT7) from cDNA of potential GPCR-binding proteins (for protocols on forming 
cDNA libraries, see Sambrook et al. L989, Molecular cloning: a laboratory manual, 
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second edition, Cold Spring Harbor Press, Cold Spring Harbor, NY), which is 
incorporated herein by reference in its entirety. The DNA-BD/nGPCR-x fusion 
construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 
5 analysis. Similar controls are performed with the AD/library fusion construct to 
ensure expression in host cells and lack of transcriptional activity. Yeast cells are 
transformed (ca. 105 transformants/mg DNA) with both the nGPCR-x and library 
fusion plasmids according to standard procedures (Ausubel et al., 1992, Short 
protocols in molecular biology, fourth edition, Greene and Wiley-interscience, NY, 

10 which is incorporated herein by reference in its entirety). In vivo binding of DNA- 
BD/nGPCR-x with AD/library proteins results in transcription of specific yeast 
plasmid reporter genes (i.e., lacZ, H1S3, ADE2, LEU2). Yeast cells are plated on 
nutrient-deficient media to screen for expression of reporter genes. Colonies are 
dually assayed for p-galactosidase activity upon growth in Xgal (5-bromo-4-chloro-3- 

15 indolyl-p-D-galactoside) supplemented media (filter assay for p-galactosidase activity 
is described in Breeden et al., Cold Spring Harb. Symp. Quant. Biol., 1985, 50, 643, 
which is incorporated herein by reference in its entirety). Positive AD-library 
plasmids are rescued from transfoimants and reintroduced into the original yeast 
strain as well as other strains containing unrelated DNA-BD fusion proteins to 

20 confirm specific nGPCR-x/library protein interactions. Insert DNA is sequenced to 
verify the presence of an open reading frame fused to GAL4 AD and to determine the 
identity of the nGPCR-x-binding protein. 

EXAMPLE 8: MOBILITY SHIFT DNA-BINDING ASSAY USING GEL 
25 ELECTROPHORESIS 

A gel electrophoresis mobility shift assay can rapidly detect specific protein- 
DNA interactions. Protocols are widely available in such manuals as Sambrook et al. 
1989, Molecular cloning: a laboratory manual, second edition, Cold Spring Harbor 
Press, Cold Spring Harbor, NY and Ausubel, F. M. et al., 1992, Short Protocols in 
30 Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, each of which 
is incorporated herein by reference in its entirety. 

Probe DNA(<300 bp) is obtained from synthetic oligonucleotides, restriction 
endonuclease fragments, or PCR fragments and end-labeled with 32 P. An aliquot of 
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purified nGPCR-x (ca. 1 5 ug) or crude nGPCR-x extract (ca. 15 ng) is incubated at 
constant temperature (in the range 22-37 C) for at least 30 minutes in 10-15 ul of 
buffer (i.e. TAE or TBE, pH 8.0-8.5) containing radiolabeled probe DNA, nonspecific 
carrier DNA (ca. 1 ug), BSA (300 ug/ml), and 10% (v/v) glycerol. The reaction 
5 mixture is then loaded onto a polyacrylamide gel and run at 30-35 mA until good 
separation of free probe DNA from protein-DNA complexes occurs. The gel is then 
dried and bands corresponding to free DNA and protein-DNA complexes are detected 
by autoradiography. 



1 0 EXAMPLE 9: ANTIBODIES TO nGPCR-X 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the nGPCR-x receptor, and to generate useful antigen-binding fragments 
thereof or variants thereof, including "humanized" variants. Such protocols can be 
found, for example, in Sambrook et al. (1989) and Harlow et ai (Eds.), Antibodies A 

15 Laboratory Manual ; Cold Spring Harbor Laboratory; Cold Spring Harbor, NY (1988). 
In one embodiment, recombinant nGPCR-x polypeptides (or cells or cell membranes 
containing such polypeptides) are used as antigen to generate the antibodies. In 
another embodiment, one or more peptides having amino acid sequences 
corresponding to an immunogenic portion of nGPCR-x (e.g., 6, 7, 8, 9, 10, 1 1, 12, 13, 

20 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of nGPCR-x, especially hydrophilic 
extracellular portions, are preferred. The antigen may be mixed with an adjuvant or 
linked to a hapten to increase antibody production. 



25 A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant nGPCR-x or a synthetic fragment 
thereof is used to immunize a mouse for generation of monoclonal antibodies (or 
larger mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 
peptides are conjugated to Keyhole Lympet Hemocyanin (Pierce), according to the 

30 manufacturer's recommendations. For an initial injection, the antigen is emulsified 

with Freund's Complete Adjuvant and injected subcutaneously. At intervals of two to 
three weeks, additional aliquots of nGPCR-x antigen are emulsified with Freund's 
Incomplete Adjuvant and injected subcutaneously. Prior to the final booster injection, 
a serum sample is taken from the immunized mice and assayed by western blot to 
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confirm the presence of antibodies that immunoreact with nGPCR-x. Serum from the 
immunized animals may be used as polyclonal antisera or used to isolate polyclonal 
antibodies that recognize nGPCR-x. Alternatively, the mice are sacrificed and their 
spleen removed for generation of monoclonal antibodies. 
5 To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free 

RPMI 1640, and single cell suspensions are formed by grinding the spleens in 
serum-free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium 
pyruvate, 100 units/ml penicillin, and 100 /xg/ml streptomycin (RPMI) (Gibco, 
Canada). The cell suspensions are filtered and washed by centrifugation and 

10 resuspended in serum-free RPMT. Thymocytes taken from three naive Balb/c mice 
are prepared in a similar manner and used as a Feeder Layer. NS-1 myeloma cells, 
kept in log phase in RPMI with 10% fetal bovine serum (FBS) (Hyclone Laboratories, 
Inc., Logan, Utah) for three days prior to fusion, are centrifuged and washed as well. 
To produce hybridoma fusions, spleen cells from the immunized mice are 

15 combined with NS-1 cells and centrifuged, and the supernatant is aspirated. The cell 
pellet is dislodged by tapping the tube, and 2 ml of 37°C PEG 1500 (50% in 75 mM 
HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free RPMI. Thereafter, the cells are centrifuged, resuspended in 
RPMI containing 15% FBS, 100 fiM sodium hypoxanthine, 0.4 fj.M aminopterin, 16 

20 fiM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 
10 6 thymocytes/ml, and plated into 10 Corning flat-bottom 96-welI tissue culture 
plates (Corning, Coming New York). 

On days 2, 4, and 6 after the fusion, 100 ill of medium is removed from the 
wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are 

25 screened by ELISA, testing for the presence of mouse IgG that binds to nGPCR-x. 
Selected fusion wells are further cloned by dilution until monoclonal cultures 
producing anti-nGPCR-x antibodies are obtained. 

B. Humanization of anti-nGPCR-x monoclonal antibodies 
30 The expression pattern of nGPCR-x as reported herein and the proven track 

record of GPCRs as targets for therapeutic intervention suggest therapeutic 
indications for nGPCR-x inhibitors (antagonists). nGPCR-x-neutralizing antibodies 
comprise one class of therapeutics useful as nGPCR-x antagonists. Following are 
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protocols to improve the utility of anti-nGPCR-x monoclonal antibodies as 
therapeutics in humans by "humanizing" the monoclonal antibodies to improve their 
serum half-life and render them less immunogenic in human hosts (i.e., to prevent 
human antibody response to non-human anti-nGPCR-x antibodies). 

The principles of humanization have been described in the literature and are 
facilitated by the modular arrangement of antibody proteins. To minimize the 
possibility of binding complement, a humanized antibody of the JgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of 
interest with the constant domains of human antibody molecules. (See, e.g., Morrison 
et ai, Adv. Immunol., 44:65-92 (1989)). The variable domains of nGPCR-x- 
neutralizmg anti-nGPCR-x antibodies are cloned from the genomic DNA of a B-cell 
hybridoma or from cDNA generated from mRNA isolated from the hybridoma of 
interest. The V region gene fragments are linked to exons encoding human antibody 
constant domains, and the resultant construct is expressed in suitable mammalian host 
cells {e.g., myeloma or CHO cells). 

To achieve an even greater level of humanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are 
cloned into human antibody sequences. (See, e.g., Jones et al, Nature 321:522-525 
(1986); Riechmann et ai, Nature 332:323-327 (1988); Verhoeyen et ai, Science 
239:1534-36 (1988); and Tempest et ai, Bio/Technology 9: 266-71 (1991)). If 
necessary, the £-sheet framework of the human antibody surrounding the CDR3 
regions also is modified to more closely mirror the three dimensional structure of the 
antigen-binding domain of the original monoclonal antibody. (See Kettleborough 
et aL, Protein Engin., 4:773-783 (1991); and Foote et ai, J. Mol. BioL, 224:487-499 
(1992)). 

In an alternative approach, the surface of a non-human monoclonal antibody 
of interest is humanized by altering selected surface residues of the non-human 
antibody, e.g., by site-directed mutagenesis, while retaining all of the interior and 
contacting residues of the non-human antibody. See Padlan, Molecular Immunol., 
28(4/5):489-98 (1991). 
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The foregoing approaches are employed using nGPCR-x-neutralizing 
anti-nGPCR-x monoclonal antibodies and the hybridomas that produce them to 
generate humanized nGPCR-x-neutralizing antibodies useful as therapeutics to treat 
or palliate conditions wherein nGPCR-x expression or ligand-mediated nGPCR-x 
5 signaling is detrimental. 

C. Human nGPCR-x-Neutralizing Antibodies from Phage Display 
Human nGPCR-x-neutralizing antibodies are generated by phage display 
techniques such as those described in Aujame et al, Human Antibodies 8(4): 155- 168 

10 (1997); Hoogenboom, TEBTECH 15:62-70 (1997); and Rader et al., Curr. Opin. 
Biotechnol. 8:503-508 (1997), all of which are incorporated by reference. For 
example, antibody variable regions in the form of Fab fragments or linked single 
chain Fv fragments are fused to the amino terminus of filamentous phage minor coat 
protein pill. Expression of the fusion protein and incorporation thereof into the 

15 mature phage coat results in phage particles that present an antibody on their surface 
and contain the genetic material encoding the antibody. A phage library comprising 
such constructs is expressed in bacteria, and the library is screened for nGPCR- 
x-specific phage-antibodies using labeled or immobilized nGPCR-x as antigen-probe. 

20 D. Human nGPCR-x-neutralizing antibodies from transgenic mice 

Human nGPCR-x-neutralizing antibodies are generated in transgenic mice 
essentially as described in Bruggemann et al., Immunol. Today 17(8):391-97 (1996) 
and Bruggemann et al. t Curr. Opin. Biotechnol. 8:455-58 (1997). Transgenic mice 
carrying human V-gene segments in geimline configuration and that express these 

25 transgenes in their lymphoid tissue are immunized with a nGPCR-x composition 

using conventional immunization protocols. Hybridomas are generated using B cells 
from the immunized mice using conventional protocols and screened to identify 
hybridomas secreting anti-nGPCR-x human antibodies (e.g., as described above). 

30 EXAMPLE 10: ASSAYS TO IDENTIFY MODULATORS OF nGPCR-X 
ACTIVITY 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of nGPCR-x activity. Among the modulators that can be 
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identified by these assays are natural ligand compounds of the receptor; synthetic 
analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified by high-throughput 
5 screening of libraries; and the like. All modulators that bind nGPCR-x are useful for 
identifying nGPCR-x in tissue samples {e.g., for diagnostic purposes, pathological 
purposes, and the like). Agonist and antagonist modulators are useful for up- 
regulating and down-regulating nGPCR-x activity, respectively, to treat disease states 
characterized by abnormal levels of nGPCR-x activity. The assays may be performed 
10 using single putative modulators, and/or may be performed using a known agonist in 
combination with candidate antagonists (or visa versa). 
A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured in nGPCR-x-transfected cells that have been exposed to candidate 

15 modulator compounds. Protocols for cAMP assays have been described in the 

literature. (See, e.g., Sutherland et al , Circulation 37: 279 (1968); Frandsen et al., 
Life Sciences 18: 529-541 (1976); Dooley et al., Journal of Pharmacology and 
Experimental Therapeutics 283 (2): 735-41 (1997); and George et al, Journal of 
Biomolecular Screening 2 (4): 235-40 (1997)). An exemplary protocol for such an 

20 assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN™ Life 
Science Products, is set forth below. 

Briefly, the nGPCR-x coding sequence (e.g. , a cDNA or intronless genomic 
DNA) is subcloned into a commercial expression vector, such as pzeoSV2 
(invitrogen), and transiently transfected into Chinese Hamster Ovary (CHO) cells 

25 using known methods, such as the transfection protocol provided by Boehringer- 
Mannheim when supplying the FuGENE 6 transfection reagent. Transfected CHO 
cells are seeded into 96-well microplates from the FlashPlate® assay kit, which are 
coated with solid scintillant to which antisera to cAMP has been bound. For a 
control, some wells are seeded with wild type (untransfected) CHO cells. Other wells 

30 in the plate receive various amounts of a cAMP standard solution for use in creating a 
standard curve. 

One or more test compounds (i.e. , candidate modulators) are added to the cells 
in each well, with water and/or compound-free medium/diluent serving as a control or 
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controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buffer 
containing f I25 l]-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [ l25 I]-cAMP compete for antibody bound to the 
plate. A standard curve is constructed, and cAMP values for the unknowns are 
obtained by interpolation. Changes in intracellular cAMP levels of cells in response 
to exposure to a test compound are indicative of nGPCR-x modulating activity. 
Modulators that act as agonists of receptors which couple to the G s subtype of G 
proteins will stimulate production of cAMP, leading to a measurable 3-10 fold 
increase in cAMP levels. Agonists of receptors which couple to the G l/o subtype of G 
proteins will inhibit forskolin-stimulated cAMP production, leading to a measurable 
decrease in cAMP levels of 50-100%. Modulators that act as inverse agonists will 
reverse these effects at receptors that are either constitutively active or activated by 
known agonists. 

B. Aequorin Assays 

In another assay, cells (e.g., CHO cells) are transiently co-transfected with 
both a nGPCR-x expression construct and a construct that encodes the photoprotein 
apoaquorin. In the presence of the cofactor coelenterazine, apoaquorin will emit a 
measurable luminescence that is proportional to the amount of intracellular 
(cytoplasmic) free calcium. (See generally, Cobbold, et al. "Aequorin measurements 
of cytoplasmic free calcium/' In: McCormack J.G. and Cobbold P.H., eds., Cellular 
Calcium: A Practical Approach. Oxford:IRL Press (1991); Stables et al., Analytical 
Biochemistry 252: 1 15-26 (1997); and Haugland, Handbook of Fluorescent Probes 
and Research Chemicals. Sixth edition. Eugene OR: Molecular Probes (1996).) 

Tn one exemplary assay, nGPCR-x is subcloned into the commercial 
expression vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a 
construct that encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) 
into CHO cells using the transfection reagent FuGENE 6 (Boehnnger-Mannheim) and 
the transfection protocol provided in the product insert. 

The cells are cultured for 24 hours at 37°C in MEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
U/ml penicillin and 10 jig/ml streptomycin, at which time the medium is changed to 
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serum-free MEM containing 5 /xM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37°C. Subsequently, cells are 
detached from the plate using VERS EN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

Dilutions of candidate nGPCR-x modulator compounds are prepared in serum- 
free MEM and dispensed into wells of an opaque 96-well assay plate at 50 /zlAvell. 
Plates are then loaded onto an MLX microtiter plate luminometer (Dynex 
Technologies, Inc., Chantilly, VA). The instrument is programmed to dispense 50 pii 
cell suspensions into each well, one well at a time, and immediately read 
luminescence for 1 5 seconds. Dose-response curves for the candidate modulators are 
constructed using the area under the curve for each light signal peak. Data are 
analyzed with Slide Write, using the equation for a one-site ligand, and EC 50 values 
are obtained. Changes in luminescence caused by the compounds are considered 
indicative of modulatory activity. Modulators that act as agonists at receptors which 
couple to the G q subtype of G proteins give an increase in luminescence of up to 100 
fold. Modulators that act as inverse agonists will reverse this effect at receptors that 
are either constitutively active or activated by known agonists. 
C. Luciferase Reporter Gene Assay 

The photoprotein luciferase provides another useful tool for assaying for 
modulators of nGPCR-x activity. Cells {e.g., CHO cells or COS 7 cells) are 
transiently co-transfected with both a nGPCR-x expression construct (e.g., nGPCR-x 
in pzeoSV2) and a reporter construct which includes a gene for the luciferase protein 
downstream from a transcription factor binding site, such as the cAMP-response 
element (CRE), AP-1, or NF-kappa B. Agonist binding to receptors coupled to the G s 
subtype of G proteins leads to increases in cAMP, thereby activating the CRE 
transcription factor and resulting in expression of the luciferase gene. Agonist 
binding to receptors coupled to the G q subtype of G protein leads to production of 
diacylglycerol that activates protein kinase C, which activates the AP-1 or NF-kappa 
B transcription factors, in turn resulting in expression of the luciferase gene. 
Expression levels of luciferase reflect the activation status of the signaling events. 
(See generally, George et a/., Journal of Biomolecular Screening 2(4); 235-240 
(1997); and Stratowa et al. y Current Opinion in Biotechnology 6: 574-581 (1995)). 
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Luciferase activity may be quantitatively measured using, e.g., luciferase assay 
reagents that are commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 cells/well one day prior to transfection and cultured at 37°C in 
5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
U/ml penicillin and 10 fxg/m\ streptomycin. Cells are transiently co-transfected with 
both a nGPCR-x expression construct and a reporter construct containing the 
luciferase gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF- 
kappaB-luciferase may be purchased from Stratagene (LaJolla, CA). Transfections 

10 are performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's instructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37°C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 

15 incubated at 37°C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 fi\ of lysis buffer per well from the luciferase assay 
kit supplied by Promega. After incubation for 15 minutes at room temperature, 15 [i\ 
of the lysate is mixed with 50 fx\ of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 

20 MicroBeta scintillation and luminescence counter (Wallace Instruments, 
Gaithersburg, MD). 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3 to 20-fold stimulation 

25 of luminescence compared to cells transfected with the reporter gene alone. 
Modulators that act as inverse agonists will reverse this effect. 



Changes in intracellular calcium levels are another recognized indicator of G 
protein-coupled receptor activity, and such assays can be employed to screen for 
30 modulators of nGPCR-x activity. For example, CHO cells stably transfected with a 
nGPCR-x expression vector are plated at a density of 4 x 10 4 cells/well in Packard 
black-walled, 96-well plates specially designed to discriminate fluorescence signals 
emanating from the various wells on the plate. The cells are incubated for 60 minutes 
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Intracellular calcium measurement using FLIPR 
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at 37°C in modified Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L 
glucose with the addition of 1% fetal bovine serum and one of four calcium indicator 
dyes (Fluo-3™ AM, Fluo-4™ AM, Calcium Green™- 1 AM, or Oregon Green™ 488 
BAPTA-1 AM), each at a concentration of 4 fiM. Plates are washed once with 
5 modified D-PBS without 1% fetal bovine serum and incubated for 10 minutes at 37°C 
to remove residual dye from the cellular membrane. In addition, a series of washes 
with modified D-PBS without 1% fetal bovine serum is performed immediately prior 
to activation of the calcium response. 

A calcium response is initiated by the addition of one or more candidate 

10 receptor agonist compounds, calcium ionophore A23187 (10 (jM; positive control), or 
ATP (4 /iM; positive control). Fluorescence is measured by Molecular Device's 
FLEPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al. y Drug 
Development Research, 44(1): 14-20 (1998)). The F-stop for the detector camera was 
set at 2.5 and the length of exposure was 0.4 milliseconds. Basal fluorescence of cells 

15 was measured for 20 seconds prior to addition of candidate agonist, ATP, or A23187, 
and the basal fluorescence level was subtracted from the response signal. The 
calcium signal is measured for approximately 200 seconds, taking readings every two 
seconds. Calcium ionophore A23187 and ATP increase the calcium signal 200% 
above baseline levels. In general, activated GPCRs increase the calcium signal 

20 approximately 10-15% above baseline signal. 
E. Mitogenesis Assay 

In a mitogenesis assay, the ability of candidate modulators to induce or inhibit 
nGPCR-x -mediated cell division is determined. (See, e.g., Lajiness et al. 9 Journal of 
Pharmacology and Experimental Therapeutics 267(3): 1573-1581 (1993)). For 

25 example, CHO cells stably expressing nGPCR-x are seeded into 96-well plates at a 
density of 5000 cells/well and grown at 37°C in MEM with 10% fetal calf serum for 
48 hours, at which time the cells are rinsed twice with serum-free MEM. After 
rinsing, 80 of fresh MEM, or MEM containing a known mitogen, is added along 
with 20 ^il MEM containing varying concentrations of one or more candidate 

30 modulators or test compounds diluted in serum-free medium. As controls, some wells 
on each plate receive serum-free medium alone, and some receive medium containing 
10% fetal bovine serum. Untransfected cells or cells transfected with vector alone 
also may serve as controls. 
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Afterculture for 16-18 hours, 1 /iCi of [ 3 H]-thymidine (2 Ci/mmol) is added to 
the wells and cells are incubated for an additional 2 hours at 37°C. The cells are 
trypsinized and collected on filter mats with a cell harvester (Tomtec); the filters are 
then counted in a Betaplate counter. The incorporation of [ 3 H]-thymidine in serum- 
5 free test wells is compared to the results achieved in cells stimulated with serum 

(positive control). Use of multiple concentrations of test compounds permits creation 
and analysis of dose-response curves using the non-linear, least squares fit equation: 
A = Bx [C/ (D + C)] + G where A is the percent of serum stimulation; B is the 
maximal effect minus baseline; C is the EC50; D is the concentration of the 
10 compound; and G is the maximal effect. Parameters B, C and G are determined by 
Simplex optimization. 

Agonists that bind to the receptor are expected to increase [ 3 H]-thymidine 
incorporation into cells, showing up to 80% of the response to serum. Antagonists 
that bind to the receptor will inhibit the stimulation seen with a known agonist by up 
15 to 100%. 

F. r 35 S1GTP-vS Binding Assay 

Because G protein-coupled receptors signal through intracellular G proteins 
whose activity involves GTP binding and hydrolysis to yield bound GDP, 
measurement of binding of the non-hydrolyzable GTP analog [ 35 S]GTPtS in the 

20 presence and absence of candidate modulators provides another assay for modulator 
activity. (See, e.g., Kowal et ai, Neuropharmacology 37:179-187 (1998).) 

In one exemplary assay, cells stably transfected with a nGPCR-x expression 
vector are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 
ml of ice-cold Ca 2+ /Mg 2+ -free phosphate-buffered saline, and scraped into 5 ml of the 

25 same buffer. Cells are pelleted by centrifugation (500 x g, 5 minutes), resuspended in 
TEE buffer (25 mM Tns, pH 7.5 , 5 mM EDTA, 5 mM EGTA), and frozen in liquid 
nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (one 
ml TEE per plate of cells), and centrifuged at 1,000 x g for 5 minutes to remove 
nuclei and unbroken cells. 

30 The homogenate supernatant is centrifuged at 20,000 x g for 20 minutes to 

isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM 
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MgCl 2 , 1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen 
and stored at -70°C until use. 

Aliquots of cell membranes prepared as described above and stored at -70°C 
are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl 2 , 1 mM EDTA, 120 mM NaCl, 10 fiM GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 /xg/ml. In a final volume of 90 fi\, homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 fxM GTP for 
30 minutes at 30°C and then placed on ice. To each sample, 10 /zl guanosine S'-O- 
(3[ 35 S]thio) triphosphate (NEN, 1200 Ci/mmol; [ 35 S]-GTP->5), was added to a final 
concentration of 100-200 pM. Samples are incubated at 30°C for an additional 30 
minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCl 2 , at 4°C is added and the 
reaction is stopped by filtration. 

Samples are filtered over Whatman GF/B filters and the filters are washed 
with 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCI 2 . Filters are counted by 
liquid scintillation spectroscopy. Nonspecific binding of [ 35 S]-GTPtS is measured in 
the presence of 100 ^M GTP and subtracted from the total. Compounds are selected 
that modulate the amount of [ 35 S]-GTPtS binding in the cells, compared to 
untransfected control cells. Activation of receptors by agonists gives up to a five-fold 
increase in [ 35 S]GTPtS binding. This response is blocked by antagonists. 

G. MAP Kinase Activity Assay 

Evaluation of MAP kinase activity in cells expressing a GPCR provides 
another assay to identify modulators of GPCR activity. (See, e.g., Lajiness et al., 
Journal of Pharmacology and Experimental Therapeutics 267(3): 1573-1 581 (1993) 
and Boulton et al y Cell 65:663-675 (1991).) 

In one embodiment, CHO cells stably transfected with nGPCR-x are seeded 
into 6-well plates at a density of 70,000 cells/well 48 hours prior to the assay. During 
this 48-hour period, the cells are cultured at 37°C in MEM medium supplemented 
with 10% fetal bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 /ig/ml 
streptomycin. The cells are serum-starved for 1-2 hours prior to the addition of 
stimulants. 

For the assay, the cells are treated with medium alone or medium containing 
either a candidate agonist or 200 nM Phorbol ester- myristoyl acetate (i.e., PMA, a 
positive control), and the cells are incubated at 37°C for varying times. To stop the 
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reaction, the plates are placed on ice, the medium is aspirated, and the cells are rinsed 
with 1 ml of ice-cold PBS containing 1 mM EDTA. Thereafter, 200 fx] of cell lysis 
buffer (12.5 mM MOPS, pH 7.3, 12.5 mM glycerophosphate, 7.5 mM MgCl 2 , 0.5 mM 
EGTA, 0.5 mM sodium vanadate, 1 mM benzamidine, 1 mM dithiothreitol, 10 iiz/m\ 

5 leupeptin, 10 }JLgJm\ aprotinin, 2 /ig/ml pepstatin A, and 1 ^M okadaic acid) is added 
to the cells. The cells are scraped from the plates and homogenized by 10 passages 
through a 23 3/4 G needle, and the cytosol fraction is prepared by centrifugation at 
20,000 x g for 15 minutes. 

Aliquots (5-10 fi\ containing 1-5 fig protein) of cytosol are mixed with 1 mM 

10 MAPK Substrate Peptide (APRTPGGRR (SEQ ID NO: 129), Upstate Biotechnology, 
Inc., N.Y.) and 50 [t- 32 P]ATP (NEN, 3000 Ci/mmol), diluted to a final specific 
activity of -2000 cpm/pmol, in a total volume of 25 /xl. The samples are incubated 
for 5 minutes at 30°C, and reactions are stopped by spotting 20 fi\ on 2 cm 2 squares of 
Whatman P81 phosphocellulose paper. The filter squares are washed in 4 changes of 

15 1 % H3PO4, and the squares are subjected to liquid scintillation spectroscopy to 

quantitate bound label. Equivalent cytosolic extracts are incubated without MAPK 
substrate peptide, and the bound label from these samples are subtracted from the 
matched samples with the substrate peptide. The cytosolic extract from each well is 
used as a separate point. Protein concentrations are determined by a dye binding 

20 protein assay (Bio-Rad Laboratories). Agonist activation of the receptor is expected 
to result in up to a five-fold increase in MAPK enzyme activity. This increase is 
blocked by antagonists. 

H. [ 3 HlArachidonic Acid Release 

The activation of GPCRs also has been observed to potentiate arachidonic acid 
25 release in cells, providing yet another useful assay for modulators of GPCR activity. 
(See, e.g., Kanterman et al., Molecular Pharmacology 39:364-369 (1991).) For 
example, CHO cells that are stably transfected with a nGPCR-x expression vector are 
plated in 24-well plates at a density of 15,000 cells/well and grown in MEM medium 
supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 U/ml penicillin and 
30 10 /xg/ml streptomycin for 48 hours at 37°C before use. Cells of each well are labeled 
by incubation with [ 3 H]-arachidonic acid (Amersham Corp., 210 Ci/mmol) at 0.5 
jiCi/m\ in 1 ml MEM supplemented with 10 mM HEPES, pH 7.5, and 0.5% fatty- 



141 




WO 01/36473 PCT/USO0/3158I 

acid-free bovine serum albumin for 2 hours at 37°C. The cells are then washed twice 
with 1 ml of the same buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 /zM ATP and the cells are incubated at 37°C for 30 minutes. Buffer 
5 alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
well are counted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [ 3 H]-arachidonic 
acid. This potentiation is blocked by antagonists. 
I. Extracellular Acidification Rate 

10 In yet another assay, the effects of candidate modulators of nGPCR-x activity 

are assayed by monitoring extracellular changes in pH induced by the test 
compounds. (See, e.g., Dunlop et a!., Journal of Pharmacological and Toxicological 
Methods 40(l):47-55 (1998).) In one embodiment, CHO cells transfected with a 
nGPCR~x expression vector are seeded into 12 mm capsule cups (Molecular Devices 

15 Corp.) at 4 x 10 5 cells/cup in MEM supplemented with 10% fetal bovine serum, 2 
mM L-glutamine, 10 U/ml penicillin, and 10 /xg/ml streptomycin. The cells are 
incubated in this medium at 37°C in 5% C0 2 for 24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 

20 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml 
penicillin, 10 pig/m\ streptomycin, 26 mM NaCl) at a flow rate of 100 ^il/minute. 
Candidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 

25 38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is 
re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in 
the rate of acidification are calculated by subtracting the baseline value (the average 

30 of 4 rate measurements immediately before addition of a modulator candidate) from 
the highest rate measurement obtained after addition of a modulator candidate. The 
selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the 
receptor result in an increase in the rate of extracellular acidification compared to the 
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rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

EXAMPLE 1 1 : IN SITU HYBRIDIZATION 
5 DNA Probe Preparation For nGPCR-11, -16, -40, -54, and -56 

DNA probes for in situ hybridization were prepared as follows. Two sets of 
primer pairs were prepared. The first set has the sequence for T7 polymerase 
promoter on the 5' primer to make the sense RNA, and the second set has the T7 
polymerase promoter sequence on the 3' primer to make the antisense RNA. PCR 

10 was performed in a 50 ul reaction containing 36.5 ul H 2 0, 5ul lOxTT buffer (140 
mM Ammonium Sulfate, 0.1 % gelatine, 0.6 M Tris-tricine pH 8.4), 5 ul 25mM 
MgCl 2 , 2 ul 10 mM dNTP, 0.4 ul Incyte clone 1722192 DNA, 0.5 ul AmpliTaq (PE 
Applied Biosystems), and 0.3 ul oligol (1 mg/ml) and 0.3 ul oligo2 (lmg/ml)[to 
make the sense RNA], or 0.3 ul oligo3 (1 mg/ml) and 0.3 ul oligo4 (lmg/ml)[to make 

15 the antisense RNA]. The PCR reaction involved one cycle at 94°C for 2 min followed 
by 35 cycles at 94°C for 30 sec, 60°C for 30 sec, 72°C for 30 sec. The two PCR 
reactions were loaded onto a 1.2 % agarose gel. The DNA band was excised from the 
gel, placed in a GenElute Agarose spin column (Supelco) and spun for 10 min at 
maximum speed. The eluted DNA was EtOH precipitated and resuspended in 

20 transcription buffer. The primer sequences for each nGPCR tested are listed below. 
For nGPCR- 1 1, the sense primers were: 
GCGTAATACGACTCACTATAGGGAGACCGCGTGTCTGCTAGACTCTATTTC 
C 3'(LW1658)(SEQIDNO: 159), and: 

5* TGCCACACTGATGCAACTCC 3' (LW1661) (SEQ ID NO: 160). The antisense 
25 primers were: 

GCGTAATACGACTCACTATAGGGAGACCTGCCACACTGATGCAACTCC 
(LW1659) SEQ ID NO: 161) and. 

5 > GCGTGTCTGCTAGACTCTATTTCC3 , (LW1660)(SEQIDNO: 162). The 
primer pairs yielded a product of 275bp. 
30 For nGPCR- 1 6, the sense primers were: 

5'GCGTAATACGACTCACTATAGGGAGACCGCACGCCACTCTTTACTATCC 
C (LW1645) (SEQ ID NO: 163), and: 
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5' GCACAAAACACAATTCCATAAGCC 3' (LW1648) (SEQ ID NO: 164). The 
antisense primers were: 

5'GCGTAATACGACTCACTATAGGGAGACCGCACAAAACACAATTCCATAA 
GCC 3* (LW1646) (SEQ ID NO: 165), and: 
5 5* GCTACGCCACTCTTTACTATCCC 3'(LW1647) (SEQ ID NO: 166). The 
pnmer pairs yielded a product of 283 bp. 

For nGPCR-40, the sense primers were: 
5'GCGTAATACGACTCACTATAGGGAGACCTTATGAGCAGCAATTCATCCC 
3'(LW1704) (SEQ ID NO: 167), and: 
10 5'CACACCCACCAAGAAATCAG 3'(LW1707)(SEQ ID NO: 168). The antisense 
primers were: 

5'GCGTAATACGACTCACTATAGGGAGACCCACACCCACCAAGAAATCAG 
3'(LW1705) (SEQ ID NO: 169), and: 

5' TTATGAGCAGCAATTCATCCC 3' (LW1706) (SEQ ID NO: 170). The primer 
15 pairs yielded a product of 25 1 bp. 

For nGPCR-54, the sense primers were: 
5 ' GCGT AAT ACG ACTC ACT AT AGGG AG ACCCG ATTATCC AC ACTTTG ACCC 
3 1 (LWI803) (SEQ ID NO: 171), and: 

5' CTGAAAGTTGTCGCTGACC 3' (LW1634) (SEQ ID NO: 172). The anti-sense 
20 primers were: 

GCGTAATACGACTCACTATAGGGAGACCCTGCTGAAAGTTGTCGCTGACC 
3' (LW1804)(SEQ ID NO: 1 73), and: 

5' CGATTATCCACACTTTGACCC 3' (LW1635) (SEQ TD NO: 174). The primer 
pairs yielded a product of 286 bp. 
25 For nGPCR-56, the sense primers were: 

GCGTAATACGACTCACTATAGGGAGACCCTGTAAAATTCACACAAGCACC 
3' (LW1763) (SEQ ID NO: 175), and: 

5 T AGAAGACAGAGCAACCTCC 3' (LW1 766) (SEQ ID NO: 1 76). The anti-sense 
primers were: 

30 GCGTAATACGACTCACTATAGGGAGACCAGAAGACAGAGCAACCTCC 
(LW1764) (SEQ ID NO: 177) and: 

CTGTAAAATTCACACAAGCACC (LW1765) (SEQ ID NO: 178). The pnmer 
pairs yielded a product of 272 bp. 
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DNA Probe Preparation For nGPCR-1 

Probes for nGPCR-1 were prepared as above with the following 
modifications. Using a sense primer: 

GCATGGATCCTCTTTGCTGTATTTCACCCTC) (LW1595) (SEQ ED NO: 179) 
5 and an antisense primer: 

5 'GCATGAATTCACAATGCC AGTG ATAAGGAAG 3' (LW1596) (SEQ ID NO: 
1 80), a 271 bp fragment was generated by PCR. The fragment was digested with 
BamHI and EcoRI and ligated into a Bluescriptll vector that had been cut with BamHI 
and EcoRI. The orientation of the insert was such that T7 polymerase generates the 
10 anti-sense strand and T3 polymerase generates the sense strand. 
Histochemistry 

Coronal and sagittal oriented rat brain sections were cryosectioned (20 um 
thick) using a Reichert-Jung cryostat. The individual sections were thaw-mounted 
onto silanated, nuclease-free slides (CEL Associates, Inc., Houston, TX), and stored 

15 at -80°C. The sections were processed starting with post-fixation in cold 4% 
paraformaldehyde, rinsed in cold PBS, acetylated using acetic anhydride in 
triethanol amine buffer and dehydrated through 70%, 95%, and 100% alcohols at room 
temperature (RT). This was followed with delipidation in chloroform then 
rehydration in 100% and 95% alcohol at room temperature. Sections were air-dried 

20 prior to hybridization. Two PCR fragments (— 250 bp) were generated, one that 

contained T7 polymerase on the 5' end (sense) and the other with T7 polymerase on 
the 3' end (antisense). The PCR fragments were labeled with 35 S-UTP to yield a 
specific activity of 0.655 x 10 6 cpm/pmol for antisense and 0.675 x 10 6 cpm/pmol for 
sense probe. Both riboprobes were denatured and added to hybridization buffer 

25 containing 50% formamide, 10% dextran, 03M NaCl, 10 mM Tris, 1 mM EDTA, IX 
Denhardts, and 10 mM DTT. Sequential brain cryosections were hybridized with 45 
|il/slide of the sense and antisense riboprobe hybridization mixture, then covered with 
silanized glass coverslips. The sections were hybridized overnight (15-18 hrs) at 
42°C in an incubator. 

30 Coverslips were washed off the slides in IX SSC, followed by RNase A 

treatment, and high temperature stringency washes (3X, 20 mins at 41 °C) in 0.1X 
SSC. Slides were dehydrated with 70%, 95% NH 4 OAc, and 100% NH 4 OAc alcohols, 
air-dried and exposed to Kodak BioMax MR-1 film. After 9 days of exposure, the 
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film was developed. This was followed with coating selected tissue slides with 
Kodak NTB-2 nuclear track emulsion and storing the slides in the dark for 23 days. 
The slides were then developed and counterstained with hematoxylin. Emulsion- 
coated sections were analyzed microscopically to determine the specificity of 
5 labeling. Presence of autoradiographic grains (generated by antisense probe 

hybridization) over cell bodies (versus between cell bodies) was used as an index of 

specific hybridization. 

Results 

In situ hybridization results indicated localization in the following brain areas: 
10 nGPCR-1 was localized to the dentate gyrus of hippocampus, piriform cortex, 

and red nucleus. 

nGPCR-1 1 was localized to the piriform cortex, hippocampus, red nucleus, 
subthalamic nuclei, dorsal raphe, interpeduncular nucleus, and habenula. nGPCR-1 6 
was localized to the cortex, piriform cortex, hippocampus, thalamus, subthalamic 

15 nuclei, hypothalamus, bed nucleus stria terminalis and posterior striatum. nGPCR-40 
was localized to the cortex, piriform cortex, hippocampus, substantia nigra compacta, 
hypothalamus, laterial septus, bed nucleus stria terminalis, thalamus, ventral 
tegmental area, interpeduncular nucleus, dorsal raphe, medical geniculate, islands of 
Calleja, subthamalmic nuclei, choroid plexus. nGPCR-54 was localized to the 

20 piriform cortex and hippocampus, including the dentate gyrus, CA1 and CA3. 

nGPCR-56 was localized to the piriform cortex, cortex, interpeduncular nuceus, red 
nucleus, hippocampus, habenula, substantia nigra pars compacta, mamillary body 
stria terminalis,hypothalamus, subthamalmic nuclei, corsal raphe, and ventral 
tegmental area. 

25 

EXAMPLE 12: CHROMOSOMAL LOCALIZATION 
Methods 

Chromosomal location of the genes encoding nGPCRs was determined using 
the Stanford G3 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). 
30 This panel contains 83 radiation hybrid clones of the entire human genome created by 
the Stanford Human Genome Center. PCR reactions were assembled containing 25ng 
of DNA from each clone and the components of the Expand Hi-Fi PCR System™ 
(Roche Molecular Biochemicals, Indianapolis, IN) in a final reaction volume of 15 ul. 
PCR primers were synthesized by Genosys Corp., The Woodlands, TX. PCR 



146 




W O 01/36473 PCT/US00/31581 

reactions were incubated in a GeneAmp 9700 PCR thermocycler (Perkin Elmer 
Applied Biosystems). The following cycling program was executed: Pre-soak at (94° 
for 3min.)(94° for 30 sec.)(52°C for 60 sec.)(72° for 2 min.)] for 35 cycles. PCR 
reaction products were then separated and analyzed by electrophoresis on a 2.0% 
5 agarose gel, and stained with ethidium bromide. Lanes were scored for the presence 
or absence of the expected PCR product and the results submitted to the Stanford 
Human Genome Center via e-mail for analysis (http:/Avww- 
shgc. stan ford.edu/RH/rhserverformnew. html). 
nGPCR-40 

10 PCR primers were designed based on the available sequence of the Celera 

sequence HUM_IDS|Contig]l 10002581 15466. The forward primer used was: 
5 > ACAGCCCCAAAGCCAAACAC3 T (SEQIDNO: 181). The reverse 
primer was: 

5'CCGCAGGAGCAATG-AAAATCAG3' (SEQ ID NO: 182). This primer 
15 set will prime the synthesis of a 220 base pair fragment in the presence of the 
appropriate genomic DNA, 

G3 Radiation Hybrid Panel Analysis places nGPCR-40 on chromosome 6, 
most nearly linked to Stanford marker SHGC- 1836 with a LOD score of 1 1.84. This 
marker lies at position 6q21. In a genome scanning data set, Cao et al. (Genomics 
20 1997 Jul 1:43(1): 1-8) found excess allele sharing for markers on 6ql3-q26. Greatest 
allele sharing was at interval 6q21-q22.3 with a maximum multipoint MLS value of 
3.06 close to marker D6S278. Replication data from a second data set found 
maximum multipoint MLS at the interval D6S424-D6S275. These results provide 
suggestive evidence for a susceptibility locus for schizophrenia in chromosome 6q 
25 from two independent data sets. 
nGPCR-54 

PCR primers were designed based on the available sequence of the Celera 
sequence GA 1 1824020. The forward primer used was: 

5'CTGTCTCTCTGTCCTCTTCC3\ (SEQ ED NO: 183). The reverse primer 
30 used was: 

5'GCACCGATCTTCATTGAATTTC3\ (SEQ ID NO: 184). This primer set 
will prime the synthesis of a 145 base pair fragment in the presence of the appropriate 
genomic DNA. 
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G3 Radiation Hybrid Panel Analysis places nGPCR-54 on chromosome 13, 
most nearly linked to Stanford marker SHGC-68276 with a LOD score of 6.31 . This 
marker lies at position 13q32. Numerous investigations have found significant 
suggestion of linkage of schizophrenia to this region of chromosome 13q32. See, for 
example, Brzustowicz et aL y Am J Hum Genet 1999 Oct; 65(4): 1096-1 103; Blouin et 
aL, Nat Genet 1998 Sep; 20(1): 70-3; Shaw et al., Am J Med Genet. 1998 Sep 7; 
81(5): 364-76; Lin et aL, Hum Genet 1997 Mar; 99(3): 417-20; Pulver et at., Cold 
Spring Harb Symp Quant Biol 1996; 61 :797-814. 

Genes localized to chromosomal regions in linkage with schizophrenia are 
candidate genes for disease susceptibility. Genes in these regions with the potential to 
play a biochemical/functional role in the disease process (like G protein coupled 
receptors) have a high probability of being a disease-modifying locus. nGPCR-40 
and -54, because of their chromosomal location, are attractive targets therefore for 
screening ligands useful in modulating cellular processes involved in schizophrenia. 

EXAMPLE 13: CLONE DEPOSIT INFORMATION 

In accordance with the Budapest Treaty, clones of the present invention have 
been deposited at the Agricultural Research Culture Collection (NRRL) International 
Depository Authority, 1815 N. University Street, Peoria, Illinois 61604, U.S.A. 
Accession numbers and deposit dates are provided below in Table 6. 



148 




WO01/36473 PCT/US00/315XI 



Table 6: DEPOSIT INFORMATION 



Clone 


Accession Number 
NRRL 


Budapest Treaty Deposit Date 








nGPCR-1 (SEQ ID NO:73) 


B-30243 


2000 Jan 18 


nGPCR -5 (SEQ ID NO: 75) 


B-30244 


2000 Jan 18 


nGPCR-16<SEQ ID NO: 81) 


B-30245 


2000 Jan 18 


nGPCR -11 (SEQ ID NO: 79) 


B-30258 


2000 Feb 02 


nGPCR -17 (SEQ ID NO: 23) 


B-30259 


2000 Feb 03 


nGPCR -9 (SEQ ID NO: 77) 


B-30262 


2000 Feb 22 


nGPCR -58 (SEQ ID NO: 91) 


B-30274 


2000 March 23 


nGPCR -56 (SEQ ID NO: 89) 


B-30288 


2000 May 5 


nGPCR -3 (SEQ ID NO: 185) 


B-30290 


2000 May 5 


nGPCR -54 (SEQ ID NO: 85) 


B-30291 


2000 May 5 


nGPCR -40 (SEQ ID NO: 83*) 


B-30299N 


2000 June 02 



* The clone deposited with NRLL Accession Number B30299N comprises a 
5 sequence identical to SEQ ID NO:83 but with the substitution of an "A" at nucleotide 
position 10. 

Example 14 - Using nGPCR-x proteins to isolate neurotransmitters 

10 The isolated nGPCR-x proteins, particularly nGPCR-1, nGPCR-3, nGPCR-9, 

nGPCR-1 1, nGPCR- 16, nGPCR-40, nGPCR-54, nGPCR-56, and nGPCR-58, (SEQ 
ID NOS: SEQ ID NO: 2, SEQ ID NO: 74; SEQ ID NO: 4, SEQ ID NO: 186; SEQ ID 
NO.10, SEQ ID NO:78; SEQ ID NO:12, SEQ ID NO:80; SEQ ID NO: 22, SEQ ID 
NO:S2, SEQ ID NO:54, SEQ ID NO:84; SEQ ID NO:60, SEQ ID NO: 86; SEQ ID 

15 NO:64, SEQ ID NO: 88, SEQ ID NO:90; SEQ ID NO:68, SEQ ID NO: 92, and SEQ 
ID NO:94, respectively) can be used to isolate novel or known neurotransmitters 
(Saito et a/., Nature 400: 265-269, 1999). The cDNAs that encode the isolated 
nGPCR-x can be cloned into mammalian expression vectors and used to stably or 
transiently transfect mammalian cells including CHO, Cos or HEK293 cells. 

20 Receptor expression can be determined by Northern blot analysis of transfected cells 
and identification of an appropriately sized mRNA band (predicted size from the 
cDNA). Brain regions shown by mRNA analysis to express each of the nGPCR-x 
proteins could be processed for peptide extraction using any of several protocols 
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((Reinsheidk R.K. et al., Science 270: 243-247, 1996; Sakurai, T., et al., Cell 92; 573- 
585, 1998; Hinuma, S., et al., Nature 393: 272-276, 1998). Chromoto graphic 
fractions of brain extracts could be tested for ability to activate nGPCR-x proteins by 
measuring second messenger production such as changes in cAMP production in the 
presence or absence of forskolin, changes in inositol 3-phosphate levels, changes in 
intracellular calcium levels or by indirect measures of receptor activation including 
receptor stimulated mitogenesis, receptor mediated changes in extracellular 
acidification or receptor mediated changes in reporter gene activation in response to 
cAMP or calcium (these methods should all be referenced in other sections of the 
patent). Receptor activation could also be monitored by co-transfecting cells with a 
chimeric GI q /j 3 to force receptor coupling to a calcium stimulating pathway (Conklin 
et al., Nature 363; 274-276, 1993). Neurotransmitter mediated activation of receptors 
could also be monitored by measuring changes in [ 35 S]-GTPKS binding in 
membrane fractions prepared from transfected mammalian cells. This assay could 
also be performed using baculoviruses containing nGPCR-x proteins infected into 
SF9 insect cells. 

The neurotransmitter which activates nGPCR-x proteins can be purified to 
homogeneity through successive rounds of purification using nGPCR-x proteins 
activation as a measurement of neurotransmitter activity. The composition of the 
neurotransmitter can be determined by mass spectrometry and Edman degradation if 
peptidergic. Neurotransmitters isolated in this manner will be bioactive materials 
which will alter neurotransmission in the central nervous system and will produce 
behavioral and biochemical changes. 

Example 15 - Using nGPCR-x proteins to isolate and purify G proteins 

cDNAs encoding nGPCR-x proteins are epitope-tagged at the amino 
terminuus end of the cDNA with the cleavable influenza-hemagglutinin signal 
sequence followed by the FLAG epitope (IBT, New Haven, CT). Additionally, these 
sequences are tagged at the carboxyl terminus with DNA encoding six histidine 
residues. (Amino and Carboxyl Terminal Modifications to Facilitate the Production 
and Purification of a G Protein-Coupled Receptor, B.K. Kobilka , Analytical 
Biochemistry, Vol. 231, No. 1, Oct 1995, pp. 269-271). The resulting sequences are 
cloned into a baculovirus expression vector such as pVL1392 (Invitrogen). The 
baculovirus expression vectors are used to infect SF-9 insect cells as described (Guan, 
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X. M, Kobilka, T. S., and Kobilka, B. K. (1992) J. Biol. Chem. 267, 21995-21998). 
Infected SF-9 cells could be grown in 1000-ml cultures in SF900 II medium (Life 
Technologies, Inc.) containing 5% fetal calf serum (Gemini, Calabasas, CA) and 0.1 
mg/ml gentamicin (Life Technologies, Inc.) for 48 hours at which time the cells could 
be harvested. Cell membrane preparations could be separated from soluble proteins 
following cell lysis. nGPCR-x protein purification is carried out as described for 
purification of the 92 receptor (Kobilka, Anal. Biochem., 231 (1): 269-271, 1995) 
including solubilization of the membranes in 0.8-1 .0 % n-dodecyl -D-maltoside (DM) 
(CalBiochem, La Jolla, CA) in buffer containing protease inhibitors followed by Ni- 
column chromatography using chelating Sepharose™ (Pharmacia, Uppsala, Sweden). 
The eluate from the Ni-column is further purified on an Ml anti-FLAG antibody 
column (IBI). Receptor containing fractions are monitored by using receptor specific 
antibodies following western blot analysis or by SDS-PAGE analysis to look for an 
appropriate sized protein band (appropriate size would be the predicted molecular 
weight of the protein). 

This method of purifying G protein is particularly useful to isolate G proteins 
that bind to the nGPCR-x proteins in the absence of an activating ligand. 

Some of the preferred embodiments of the invention described above are 
outlined below and include, but are not limited to, the following embodiments. As 
those skilled in the art will appreciate, numerous changes and modifications may be 
made to the preferred embodiments of the invention without departing from the spirit 
of the invention. It is intended that all such variations fall within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated 
by reference. 
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What is claimed is: 

1. An isolated nucleic acid molecule comprising a nucleotide sequence that 
5 encodes a polypeptide comprising an amino acid sequence homologous to even 

numbered sequences selected from the group consisting of: SEQ ID NO:2 to SEQ ED 
NO:94, SEQ ID NO: 186, and fragments thereof; said nucleic acid molecule encoding 
at least a portion of nGPCR-x. 

10 2. The isolated nucleic acid molecule of claim 1 comprising a sequence that 

encodes a polypeptide comprising even numbered sequences selected from the group 
consisting of SEQ ID NO:2 to SEQ ID NO:94, SEQ ID NO: 1 86, and fragments 
thereof. 

15 3. The isolated nucleic acid molecule of claim 1 comprising a sequence 

homologous to odd numbered sequences selected from the group consisting of SEQ 
ID NO:l to SEQ ID NO:93, SEQ ID NO: 185 and fragments thereof 

4. The isolated nucleic acid molecule of claim 1 comprising a sequence selected 
20 from the group of odd numbered sequences consisting of SEQ ID NO:l to SEQ ID 

NO: 93, SEQ ID NO: 185 and fragments thereof. 

5. The isolated nucleic acid molecule of claim 4 comprising a sequence selected 
from the group of odd numbered sequences consisting of SEQ ID NO:l to SEQ ID 

25 NO:93 and SEQ ID NO: 185. 

6. The isolated nucleic acid molecule of claim 4 wherein said nucleotide 
sequence is selected from the group consisting of: SEQ ID NO:l, SEQ ID NO:73, 
SEQ ID NO:9, SEQ ID NO:77, SEQ ID NO:l 1, SEQ ID NO:79, SEQ ID NO:21, 

30 SEQ ID NO:81 SEQ ID NO:53, SEQ ID NO:83, SEQ ID NO:59, SEQ ID NO:85, 
SEQ ID NO:63, SEQ ID NO:87, SEQ ID NO:89, SEQ ID NO:67, SEQ ID NO:91, 
SEQ ID NO:93, SEQ ID NO:3, and SEQ ID NO:l 85. 

7. The isolated nucleic acid molecule of claim 4 wherein said nucleotide 

35 sequence is selected from the group consisting of: SEQ ID NO:73, SEQ ID NO:77, 
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SEQ ID NO:79, SEQ ID NO:81 SEQ ID NO:83, SEQ ID NO:85, SEQ ID NO:89, 
SEQ ID NO:93 and SEQ TD NO: 185. 



8. The isolated nucleic acid molecule of claim 1 wherein said nucleic acid 
5 molecule is DNA. 

9. The isolated nucleic acid molecule of claim 1 wherein said nucleic acid 
molecule is RNA. 

10 10. An expression vector comprising a nucleic acid molecule of any one of claims 
1 to 5. 

1 1 . The expression vector of claim 10 wherein said nucleic acid molecule 
comprises a sequence selected from the group of odd numbered sequences consisting 

15 of SEQ TD NO:l to SEQ ID NO:93 and SEQ ID NO: 185. 

12. The expression vector of claim 10 wherein said nucleic acid molecule 
composes a nucleotide sequence selected from the group consisting of: SEQ ID 
NO: 1, SEQ ID NO:73, SEQ ID NO:9, SEQ ID NO:77, SEQ ID NO: 1 1, SEQ ID 

20 NO:79, SEQ ID NO: 21, SEQ ID NO:81 SEQ ID NO:53, SEQ TD NO:83, SEQ ID 
NO:59, SEQ ID NO:85, SEQ ID NO:63, SEQ ID NO:87, SEQ ID NO:89, SEQ ID 
NO:67, SEQ ID NO:91, SEQ ID NO:93, SEQ ID NO: 3, and SEQ ID NO: 185. 

13. The expression vector of claim 10 wherein said nucleotide sequence is 
25 selected from the group consisting of: SEQ ID NO: 73, SEQ ID NO:77, SEQ ID 

NO:79, SEQ ID NO:81 SEQ ID NO:83, SEQ ID NO:85, SEQ ID NO:89, SEQ ID 
NO. 93 and SEQ ID NO: 1 85. 

14. The expression vector of claim 10 wherein said vector is a plasmid. 

30 

15. The expression vector of claim 10 wherein said vector is a viral particle. 

1 6. The expression vector of claim 1 5 wherein said vector is selected from the 
group consisting of adenoviruses, baculoviruses, parvoviruses, herpesviruses, 
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poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia viruses, and 
retroviruses. 



17. The expression vector of claim 10 wherein said nucleic acid molecule is 
5 operably connected to a promoter selected from the group consisting of simian virus 
40, mouse mammary tumor virus, long terminal repeat of human immunodeficiency 
virus, maloney virus, cytomegalovirus immediate early promoter, Epstein Barr virus, 
rous sarcoma virus, human actin, human myosin, human hemoglobin, human muscle 
creatine, and human metalothionein. 

10 

1 8. A host cell transformed with an expression vector of claim 10. 



19. The transformed host cell of claim 18 wherein said cell is a bacterial cell. 



15 20. The transformed host cell of claim 19 wherein said bacterial cell is E. coli. 



20 



21. The transformed host cell of claim 18 wherein said cell is yeast. 

22. The transformed host cell of claim 21 wherein said yeast is S. cerevisiae. 

23. The transformed host cell of claim 18 wherein said cell is an insect cell. 



24. The transformed host cell of claim 23 wherein said insect cell is S. frugiperda. 
25 25. The transformed host cell of claim 18 wherein said cell is a mammalian cell. 



26. The transformed host cell of claim 25 wherein mammalian cell is selected 
from the group consisting of Chinese hamster ovary cells, HeLa cells, African green 
monkey kidney cells, human 293 cells, and murine 3T3 fibroblasts. 

27. An isolated nucleic acid molecule comprising a nucleotide sequence 
complementary to at least a portion of a sequence selected from the group of odd 
numbered sequences consisting of SEQ ID NO: 1 to SEQ ID NO:93 and SEQ ID 
NO: 185, said portion comprising at least 10 nucleotides. 
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28. The nucleic acid molecule of claim 27 wherein said molecule is an antisense 
oligonucleotide directed to a region of a sequence selected from the group of odd 
numbered sequences consisting of SEQ ED NO:l to SEQ ID NO:93 and SEQ ED NO: 

5 185. 

29. The nucleic acid molecule of claim 28 wherein said oligonucleotide is directed 
to a regulatory region of a sequence selected from the group of odd numbered 
sequences consisting of SEQ ED NO:l to SEQ ED NO:93 and SEQ ID NO: 185. 

10 

30. The nucleic acid molecule of claim 27 wherein said molecule is an antisense 
oligonucleotide directed to a region of nucleotide sequence selected from the group 
consisting of: SEQ ID NO: 73, SEQ ID NO:77, SEQ ID NO:79, SEQ ID NO:81 SEQ 
ID NO:83, SEQ ID NO:85, SEQ ID NO:89, SEQ ID NO:93 and SEQ ID NO: 185. 

15 

31. A composition comprising a nucleic acid molecule of any one of claims 1 to 5 
or 27 and an acceptable carrier or diluent. 

32. A composition comprising a recombinant expression vector of claim 10 and an 
20 acceptable carrier or diluent. 

33. A method of producing a polypeptide that comprises a sequence selected from 
the group of even numbered sequences consisting SEQ ID NO: 2 to SEQ ED NO: 94 
and SEQ ID NO: 186, and homologs and fragments thereof, said method composing 

25 the steps of: 

a) introducing a recombinant expression vector of claim 10 into a 
compatible host cell; 

b) growing said host cell under conditions for expression of said 
polypeptide; and 

30 c) recovering said polypeptide. 

34. The method of claim 33 wherein said host cell is lysed and said polypeptide is 
recovered from the lysate of said host cell. 
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25 



35. The method of claim 33 wherein said polypeptide is recovered by purifying 
the culture medium without lysing said host cell. 

36. An isolated polypeptide encoded by a nucleic acid molecule of claim 1 . 

37. The polypeptide of claim 36 wherein said polypeptide comprises a sequence 
selected from the group of even numbered sequences consisting SEQ ID NO:2 to 
SEQ ID NO:94 and SEQ ID NO: 1 86. 

38. The polypeptide of claim 36 wherein said polypeptide comprises an amino 
acid sequence homologous to a sequence selected from the group of even numbered 
sequences consisting of SEQ ID NO:2 to SEQ ID NO:94 and SEQ ID NO: 186. 

39. The polypeptide of claim 36 wherein said sequence homologous to a sequence 
selected from the group of even numbered sequences consisting of SEQ ID NO:2 to 
SEQ ID NO:94 and SEQ ID NO: 186 comprises at least one conservative amino acid 
substitution compared to the even numbered sequences in the group of even numbered 
sequences consisting of SEQ ID NO: 2 to SEQ ID NO: 94 and SEQ ID NO: 186. 

40. The polypeptide of claim 36 wherein said polypeptide comprises a fragment of 
a polypeptide with a sequence selected from the group of even numbered sequences 
consisting of SEQ ID NO:2 to SEQ ID NO:94 and SEQ ID NO: 186. 

41. The polypeptide of claim 36 wherein said polypeptide comprises an amino 
acid sequence selected from the group consisting of: SEQ ID NO: 2, SEQ ID NO: 74; 
SEQ ID NO: 4, SEQ ID NO: 186; SEQ ID NO: 10, SEQ ID NO:78; SEQ ID NO: 12, 
SEQ ID NO:80; SEQ ID NO: 22, SEQ ID NO:82; SEQ ID NO:54, SEQ ID NO:84; 
SEQ ID NO:60, SEQ ID NO: 86; SEQ ID NO:64, SEQ ID NO: 88, SEQ ID NO:90; 
SEQ ID NO:68, SEQ ID NO: 92, and SEQ ID NO:94. 

42. The polypeptide of claim 36 wherein said polypeptide comprises an amino 
acid sequence selected from the group consisting of: SEQ ID NO: 74; SEQ ID NO: 

1 86; SEQ ID NO:78; SEQ ID NO:S0; SEQ ID NO:82; SEQ ID NO:84; SEQ ID NO: 
86; SEQ ID NO:90; and SEQ ID NO:94. 
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43. A composition comprising a polypeptide of claim 36 and an acceptable carrier 
or diluent. 

5 44. An isolated antibody which binds to an epitope on a polypeptide of claim 36. 

45. The antibody of claim 44 wherein said antibody is a monoclonal antibody. 

46. A composition comprising an antibody of claim 44 and an acceptable carrier 
10 or diluent. 

47. A method of inducing an immune response in a mammal against a polypeptide 
of claim 36 comprising administering to said mammal an amount of said polypeptide 
sufficient to induce said immune response. 

15 

48. A method for identifying a compound which binds nGPCR-x comprising the 
steps of: 

a) contacting nGPCR-x with a compound; and 

b) determining whether said compound binds nGPCR-x. 

20 

49. The method of claim 48 wherein the nGPCR-x comprises an amino acid 
sequence selected from the group consisting of: SEQ ID NO: 2, SEQ ID NO: 74; SEQ 
ID NO: 4, SEQ ID NO: 1 86; SEQ ID NO: 10, SEQ ID NO:78; SEQ ID NO: 12, SEQ 
ID NO:80; SEQ ID NO: 22, SEQ ID NO:82; SEQ ID NO:54, SEQ ID NO:84; SEQ 

25 ID NO:60, SEQ ID NO: 86; SEQ ID NO:64, SEQ ID NO: 88, SEQ ID NO:90; SEQ 
ID NO:68, SEQ ID NO: 92, and SEQ ID NO:94. 

50. The method of claim 48 wherein the nGPCR-x comprises an amino acid 
sequence selected from the group consisting of: SEQ ID NO: 74; SEQ ID NO: 186; 

30 SEQ ID NO:78; SEQ ID NO:80; SEQ ID NO:82; SEQ ID NO:84; SEQ ID NO: 86; 
SEQ ID NO:90; and SEQ ID NO:94. 

5 1 . The method of claim 48 wherein binding of said compound to nGPCR-x is 
determined by a protein binding assay. 
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52. The method of claim 48 wherein said protein binding assay is selected from 
the group consisting of a gel-shift assay, Western blot, radiolabeled competition 
assay, phage-based expression cloning, co-fractionation by chromatography, co- 

5 precipitation, cross linking, interaction trap/two-hybrid analysis, southwestern 
analysis, and EL1SA. 

53. A compound identified by the method of claim 48. 

10 54. A method for identifying a compound which binds a nucleic acid molecule 
encoding nGPCR-x comprising the steps of: 

a) contacting said nucleic acid molecule encoding nGPCR-x with 
a compound; and 

b) determining whether said compound binds said nucleic acid 

15 molecule. 

55. The method of claim 54 wherein binding is determined by a gel-shift assay. 

56. A compound identified by the method of claim 54. 

20 

57. A method for identifying a compound which modulates the activity of 
nGPCR-x comprising the steps of: 

a) contacting nGPCR-x with a compound; and 

b) determining whether nGPCR-x activity has been modulated. 

25 

58. The method of claim 57 wherein the nGPCR-x comprises an amino acid 
sequence selected from the group consisting of: SEQ ID NO: 2, SEQ ID NO: 74; SEQ 
ED NO: 4, SEQ ID NO: 186; SEQ ID NO: 10, SEQ ID NO:78; SEQ ID NO: 12, SEQ 
ID NO:80; SEQ ID NO: 22, SEQ ID NO:82; SEQ ID NO:54, SEQ ID NO:84; SEQ 

30 ID NO:60, SEQ ID NO: 86; SEQ ID NO:64, SEQ ID NO: 88, SEQ ID NO:90; SEQ 
ID NO:68, SEQ ID NO: 92, and SEQ ID NO:94. 

59. The method of claim 57 wherein the nGPCR-x comprises an amino acid 
sequence selected from the group consisting of: SEQ ID NO: 74; SEQ ID NO: 186; 
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SEQ ID NO:78; SEQ ID NO:80; SEQ ID NO:82; SEQ ID NO:84; SEQ ID NO: 86; 
SEQ ID NO:90; and SEQ ID NO:94. 

60. The method of claim 57 wherein said activity is neuropeptide binding. 

61. The method of claim 57 wherein said activity is neuropeptide signaling. 

62. A compound identified by the method of claim 57. 

63. A method of identifying an animal homolog of nGPCR-x comprising the 
steps: 

a) comparing the nucleic acid sequences of the animal with a 
sequence selected from the group of odd numbered sequence consisting of SEQ ID 
NO: 1 to SEQ ID NO: 93, SEQ ED NO: 185, and portions thereof, said portions being 
at least 10 nucleotides; and 

b) identifying nucleic acid sequences of the animal that are 
homologous to said sequence selected from the group of odd numbered sequence 
consisting of SEQ ID NO: 1 to SEQ ID NO: 93, SEQ TD NO: 1 85, and portions 
thereof. 

64. The method of claim 63 wherein comparing the nucleic acid sequences of the 
animal with a sequence selected from the group of odd numbered sequence consisting 
of SEQ ID NO: 1 to SEQ ID NO: 93, SEQ ID NO: 185, and portions thereof, said 
portions being at least 10 nucleotides is performed by DNA hybridization. 

65. The method of claim 63 wherein comparing the nucleic acid sequences of the 
animal with a sequence selected from the group of odd numbered sequence consisting 
of SEQ ID NO: 1 to SEQ ID NO: 93, SEQ ID NO: 185, and portions thereof, said 
portions being at least 10 nucleotides is performed by computer homology search. 

66. A method of screening a human subject to diagnose a disorder affecting the 
brain or genetic predisposition therefor, comprising the steps of: 

(a) assaying nucleic acid of a human subject to determine a presence or 
an absence of a mutation altering an amino acid sequence, expression, or biological 
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activity of at least one nGPCR that is expressed in the brain, wherein the nGPCR 
comprises an amino acid sequence selected from the group consisting of: SEQ ID 
NO:74, SEQ ID NO; 186, SEQ ID NO:78, SEQ ID NO:80, SEQ ID NO:82, SEQ ID 
NO:84, SEQ ED NO:86, SEQ ID NO:90, and SEQ ID NO:94, and allelic variants 
5 thereof, and wherein the nucleic acid corresponds to a gene encoding the nGPCR; and 
(b) diagnosing the disorder or predisposition from the presence or 
absence of said mutation, wherein the presence of a mutation altering the amino acid 
sequence, expression, or biological activity of the nGPCR in the nucleic acid 
correlates with an increased risk of developing the disorder. 

10 

67. A method according to claim 66, wherein the nGPCR is nGPCR-40 
comprising an amino acid sequence set forth in SEQ ID NO:84 or an allelic variant 
thereof 

15 68. A method according to claim 66, wherein the nGPCR is nGPCR-54 

comprising an amino acid sequence set forth in SEQ ID NO:86 or an allelic variant 
thereof 

69. A method according to claim 66, wherein the disease is schizophrenia. 

20 

70. A method according to claim 66, wherein the assaying step comprises at least 
one procedure selected from the group consisting of: 

a) comparing nucleotide sequences from the human subject and 
reference sequences and determining a difference of either 
25 at least a nucleotide of at least one codon 

between the nucleotide sequences from the human subject that encodes an nGPCR-40 
allele and an nGPCR-40 reference sequence, or 

at least a nucleotide of at least one codon 
between the nucleotide sequences from the human subject that encodes an nGPCR-54 
30 allele and an nGPCR-54 reference sequence; 

(b) performing a hybridization assay to determine whether nucleic 
acid from the human subject has a nucleotide sequence identical to or different from 
one or more reference sequences; 
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(c) performing a polynucleotide migration assay to determine 
whether nucleic acid from the human subject has a nucleotide sequence identical to or 
different from one or more reference sequences; and 

(d) performing a restriction endonuclease digestion to determine 
whether nucleic acid from the human subject has a nucleotide sequence identical to or 
different from one or more reference sequences. 

71 . A method according to claim 70 wherein the assaying step comprises: 
performing a polymerase chain reaction assay to amplify nucleic acid comprising 
nGPCR-40 or nGPCR-54 coding sequence, and determining nucleotide sequence of 
the amplified nucleic acid. 

72. A method of screening for an nGPCR-40 or nGPCR-54 hereditary 
schizophrenia genotype in a human patient, comprising the steps of: 

(a) providing a biological sample comprising nucleic acid 
from said patient, said nucleic acid including sequences corresponding to allelles of 
nGPCR-40 or nGPCR-54; and 

(b) detecting the presence of one or more mutations in the 
nGPCR-40 allelle or the nGPCR-54 allelle; 

wherein the presence of a mutation in an nGPCR-40 allelle or nGPCR-54 
allele is indicative of a hereditary schizophrenia genotype. 

73. The method according to claim 72 wherein said biological sample is a cell 
sample. 

74. The method according to claim 72 wherein said detecting the presence of a 
mutation comprises sequencing at least a portion of said nucleic acid, said portion 
comprising at least one codon of said nGPCR-40 or nGPCR-54 alleles. 

75. The method according to claim 72 wherein said nucleic acid is DNA. 

76. The method according to claim 72 wherein said nucleic acid is RNA. 
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77. A kit for screening a human subject to diagnose schizophrenia or a genetic 
predisposition therefor, comprising, in association: 

(a) an oligonucleotide useful as a probe for identifying 
polymorphisms in a human nGPCR-40 gene or a human nGPCR-54 gene, the 
5 oligonucleotide comprising 6-50 nucleotides in a sequence that is identical or 

complementary to a sequence of a wild type human nGPCR-40 or nGPCR-54 gene 
sequence or nGPCR-40 or nGPCR-54 coding sequence, except for one sequence 
difference selected from the group consisting of a nucleotide addition, a nucleotide 
deletion, or nucleotide substitution; and 
10 (b) a media packaged with the oligonucleotide, said media 

containing information for identifying polymorphisms that correlate with 
schizophrenia or a genetic predisposition therefor, the polymophisrns being 
identifiable using the oligonucleotide as a probe. 

15 78. A method of identifying a nGPCR allelic variant that correlates with a mental 
disorder, comprising steps of: 

(a) providing a biological sample comprising nucleic acid 
from a human patient diagnosed with a mental disorder, or from the patient's genetic 
progenitors or progeny; 

20 (b) detecting in the nucleic acid the presence of one or more 

mutations in an nGPCR that is expressed in the brain, wherein the nGPCR comprises 
an amino acid sequence selected from the group consisting of SEQ ID NO:74, SEQ 
ID NO: 186, SEQ ID NO:78, SEQ ID NO:80, SEQ ID NO:82, SEQ ID NO:84, SEQ 
ID NO:86, SEQ ID NO:90, and SEQ ID NO:94, and allelic variants thereof, and 

25 wherein the nucleic acid includes sequence corresponding to the gene or genes 
encoding nGPCR; 

wherein the one or more mutations detected indicates an allelic variant that 
correlates with a mental disorder. 

30 79. A method according to claim 78, wherein the disorder is schizophrenia, and 
wherein the at least one nGPCR is nGPCR-40, nGPCR-54, or an allelic variant 
thereof. 
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80. A purified and isolated polynucleotide comprising a nucleotide sequence 
encoding an nGPCR-40 or nGPCR-54 allelic variant identified according to claim 79. 

81. A host cell transformed or transfected with a polynucleotide according to 
claim 80 or with a vector comprising the polynucleotide. 

82. A purified polynucleotide comprising a nucleotide sequence encoding 
nGPCR-40 or nGPCR-54 of a human with schizophrenia; 

wherein said polynucleotide hybridizes to the complement of SEQ ID 
NO:83 or of SEQ ID NO:85 under the following hybridization conditions: 

(a) hybridization for 16 hours at 42 °C in a hybridization solution 
comprising 50% formamide, 1% SDS, 1 M NaCl, 10% dextran sulfate and 

(b) washing 2 times for 30 minutes at 60°C in a wash solution 
comprising O.lx SSC and 1% SDS; and 

wherein the polynucleotide that encodes nGPCR-40 or nGPCR-54 amino acid 
sequence of the human differs from SEQ ID NO:84 or SEQ ID NO:86 by at least one 
residue. 

S3. A vector comprising a polynucleotide according to claim 82. 

84. A host cell that has been transformed or transfected with a polynucleotide 
according to claim 82 and that expresses the nGPCR-40 or nGPCR-54 protein 
encoded by the polynucleotide. 

85. A host cell according to claim 84 that has been co-transfected with a 
polynucleotide encoding the nGPCR-40 or nGPCR-54 amino acid sequence set forth 
in SEQ ED NO:84 or SEQ TD NO:86 and that expresses the nGPCR-40 or nGPCR-54 
protein having the amino acid sequence set forth in SEQ ID NO:84 or SEQ ID NO:86. 

86. A method for identifying a modulator of biological activity of nGPCR-40 or 
nGPCR-54 comprising the steps of: 

a) contacting a cell according to claim 84 in the presence 
and in the absence of a putative modulator compound; 
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b) measuring nGPCR-40 or nGPCR-54 biological activity 

in the cell; 

wherein decreased or increased nGPCR-40 or nGPCR-54 biological activity in 
the presence versus absence of the putative modulator is indicative of a modulator of 
5 biological activity. 

87. A method to identify compounds useful for the treatment of schizophrenia, 
said method comprising steps of: 

(a) contacting a composition comprising nGPCR-40 with a 
10 compound suspected of binding nGPCR-40 or contacting a composition comprising 

nGPCR-54 with a compound suspected of binding nGPCR-54; 

(b) detecting binding between nGPCR-40 and the compound 
suspected of binding nGPCR-40 or between nGPCR-54 and the compound suspected 
of binding nGPCR-54; 

15 wherein compounds identified as binding nGPCR-40 or nGPCR-54 are 

candidate compounds useful for the treatment of schizophrenia. 

88. A method for identifying a compound useful as a modulator of binding 
between nGPCR-40 and a binding partner of nGPCR-40 or between nGPCR-54 and a 

20 binding partner of nGPCR-54 comprising the steps of: 

(a) contacting the binding partner and a composition 
comprising nGPCR-40 or nGPCR-54 in the presence and in the absence of a putative 
modulator compound; 

(b) detecting binding between the binding partner and 
25 nGPCR-40 or nGPCR-54; 

wherein decreased or increased binding between the binding partner 
and nGPCR-40 or nGPCR-54 in the presence of the putative modulator, as compared 
to binding in the absence of the putative modulator is indicative a modulator 
compound useful for the treatment of schizophrenia. 

30 

89. A method according to claim 87 or 88 wherein the composition comprises a 
cell expressing nGPCR-40 or nGPCR-54 on its surface. 
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90. An method according to claim 89 wherein the composition comprises a cell 
transformed or transfected with a polynucleotide that encodes nGPCR-40 or nGPCR- 
54. 



5 91 . A method of purifying a G protein from a sample containing said G protein 
comprising the steps of: 

a) contacting said sample with a polypeptide of claim 1 for a time 
sufficient to allow said G protein to form a complex with said polypeptide; 

b) isolating said complex from remaining components of said 

10 sample; 

c) maintaining said complex under conditions which result in 
dissociation of said G protein from said polypeptide; and 

d) isolating said G protein from said polypeptide. 

15 92. The method of claim 91 wherein said sample comprises an amino acid 

sequence selected from the group of even numbered sequences consisting of SEQ ID 
NO:2 to SEQ ID NO:94 and SEQ ED NO: 186. 

93. The method of claim 91 wherein said polypeptide comprises an amino acid 
20 sequence homologous to a sequence selected from the group of even numbered 

sequences consisting of SEQ ID NO:2 to SEQ ID NO:94 and SEQ ID NO: 1 86. 

94. The method of claim 91 wherein said polypeptide comprises an amino acid 
sequence selected from the group consisting of: SEQ ID NO: 2, SEQ ID NO: 74; SEQ 

25 ID NO: 4, SEQ ID NO: 186; SEQ ID NO:10, SEQ ID NO:78; SEQ ID NO: 12, SEQ 
ID NO:80; SEQ ID NO: 22, SEQ ID NO:82; SEQ ID NO:54, SEQ ID NO:84; SEQ 
ID NO:60, SEQ ID NO: 86; SEQ ID NO:64, SEQ ID NO: 88, SEQ ID NO:90; SEQ 
ID NO:68, SEQ ID NO: 92, and SEQ ID NO:94. 

30 95. The method of claim 91 wherein said polypeptide comprises an amino acid 
sequence selected from the group consisting of: SEQ ID NO: 74; SEQ ID NO: 186; 
SEQ ID NO:7S; SEQ ID NO:80; SEQ ID NO:82; SEQ ID NO:84; SEQ ID NO: 86; 
SEQ ED NO:90; and SEQ ID NO:94. 
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96. An isolated nucleic acid molecule comprising a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence homologous to SEQ ID 
NO:76, and fragments thereof; said nucleic acid molecule encoding at least a portion 
ofnGPCR-5. 

97. An isolated polypeptide encoded by a nucleic acid molecule of claim 96. 



0136473A2 



166 





WO 01/3M73 



PCT/USOH/31581 



SEQUENCE LISTING 



<11C> Pharmacia & Upjohn Company 
Vogeli, Gabriel 
Huff, Rita 
Sejlitz, Torsten 
Lind, Peter 
Slightom, Jerry 
Schellm, Kathleen 
Bannigan, Chris 
Ruff, Valerie 
Kaytes, Paul 
Wood, Linda 
Parodi, Luis 
Hiebsch, Ronald 

<120> Novel G Protein Coupled Receptors 

<130> 043P1FHRM296 

<150> 60/165,838 
<151> 1999-11-16 

<150^ 60/198,568 
<151> 2000-04-20 

■;150> 60/166,071 
■:151 > 1999-11-17 

•■ISO.- 60/166, 676 
•-'151 • 1999-1 1-1 9 

•:150 - 60/173, 396 
*-151> 199 9-12-." 8 

■■.150 - 60/18 4, 129 
<151 - 2000-02-22 

-150^ 60/185,4.;! 
•'.151 * 2000-02-28 

• 150 - 60/18 5, 554 
■151.- 2000-02-.'8 

-150- 60/18 6,5 30 
■ 151> 2000-03-02 

•150:- 60/186, 81 1 
•'151:- 2000-03-03 

- 150:- 60/188, 11 4 

• 151:- 2000-03-09 

• 1 50> 60/190, 310 
-151: 2000-03-17 

' 150> 60/190, 800 
' 151 > 2 00 0-03-21 



Page 1 



; ~v.G Z -:WO C'3t J7jA, 




WO 01/36473 PCT/IJS00/31581 



<150> 


60/201, 190 


•:151> 


2000-05-02 


«;150> 


60/203, 111 


<15I> 


2000-05-08 


1 3 U > 




1 5 1 > 




•:160 > 


190 


•■ 1 7 0 > 


Pat&nt In version 3.0 


O10 > 


1 


- 2 1 1 > 


1182 


- 212> 


DNA 


-213> 


H . Sapiens 


<4 00:> 


1 



qtctgggggt 


ggggga t get 


gggacagggg 


t caattgect 


gaagcaagtg 


ctctcat ccc 


60 


cct agctcct 


gctgatct ag 


t tggggc t cc 


agagtgggga 


ggagaaaggc 


act tt gaaac 


120 


1 1 ctctgccc 


t t accgt ctt 


ageca t caaa 


ctctgagctg 


gagat agtga 


cgatgt gaca 


180 


ggaactttcc 


ct gggect ct 


ctgggccaca 


a ttcctggcc 


gagagaaaga 


ggagga atga 


24 0 


agtgagcacc 


t t ctt cactc 


etagggeca t 


g tggtagagc 


tgcag t cgea 


c :tccttctg 


300 


ccaataggca 


t agat gagtg 


gg ttgagcag 


ggagtt gece 


acgccgagca 


gecacaggta 


360 


ccgttccagc 


actaggtaga 


ggtgacactc 


ctggcaggcc 


acct gcacaa 


t gecag t gat 


4 2 0 


aaggaagggg 


gt ccaggata 


gagcaaagct 


cccaa t gaga 


acagacacag 


taeggagage 


4SQ 


r ttgaagtcg 


ctgggagtcc 


gtggggatcg 


a t aacct cca 


gecat :jgct c 


ct gca tgttc 


5-10 


ca tct t t cga 


a tct get ggc 


t g tgea tgga 


ggcaat cttg 


ageatgtege 


agtagaagaa 


60 0 


gacaaagagg 


agca t ggctg 


ggaagaagee 


aacgeaggag 


agggt cagca 


cgaagt gagg 


660 


a tgaaa t aca 


gcaaagaagc 


t gcactgccc 


t t tgt aggca 


gt ctgctgga 


acat ggggat 


720 


t ccgagtggg 


aggaagecaa 


t gaggt aaga 


cactaaccac 


agcccggcaa 


t gcaggcccc 


780 


ggccacgaac 


ecac t catga 


t ctt caagta 


geggaaggge 


tget tgatgg 


caaggt acct 


840 


gtcaaaggtg 


a tcagca tga 


ccgtgaggac 


agaggcagct 


gcggaggaag 


tgacaaa tgc 


900 


cat ccgcagg 


:tgcacaggg 


tcttctgtgt 


gggccgagaa 


gggctggaga 


gctggt ct gt 


960 


gagt aggcca 


gagatggeca 


caccaa t caa 


ggtgtcagcc 


acagecagat 


t caaggt gaa 


1020 


qcagagactg 


acacca teat 


tct tgtggat 


caacagcagc 


acagccacag 


ccactagtgt 


1080 


gtt agtagca 


a tgat gaggg 


aggecaggae 


agcaaggatc 


act ccaaatg 


agaaagat ga 


1140 


1 1 ccatgtc t 


cgaag tggca 


ggactt cact 


t accagggca 


tg 




1182 
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< 2 1 0 > 2 

<211> 335 

<212> PRT 

<213> H. Sapiens 

<h00> 2 

Met Glu Ser Ser Phe Ser Phe Gly Val He Leu Ala Val Leu Ala Ser 
I 5 1C 15 

Leu He He Ala Thr Asn Thr Leu Val Ala Val Ala Val Leu Leu Leu 
20 25 30 

He His Lys Asn Asp Gly Val Ser Leu Cys Phe Thr Leu Asn Leu Ala 
35 40 45 

Val Ala Asp Thr Leu He Gly Val Ala lie Ser Gly Leu Leu Thr Asp 
50 55 60 

Gin Leu Ser Ser Pro Ser Arg Pro Thr Gin Lys Thr Leu Cys Ser Leu 
65 70 "?5 80 

Arq Met Ala Phe Val Thr Ser Ser Ala Ala Ala Ser Val Leu Thr Val 
85 90 95 

Met Leu He Thr Phe Asp Arg Tyr Leu Ala He Lys Gin Pro Phe Arg 
100 105 HO 

Tvr Leu Lys He Met Ser Gly Phe Val Ala Gly Ala Cys He Ala Gly 
115 120 125 

Leu Trp Leu Val Ser Tyr Leu He Gly Phe Leu Pro Leu Gly He Pro 
130 135 140 

Met Phe Gin Gin Thr Ala Tyr Lys Gly Gin Cys Ser- Phe Phe Ala Val 
145 150 155 160 

Phe His Pro His Phe Val Leu Thr Leu Ser Cys Val Gly Phe Phe Pro 
165 170 175 

Ala Met Leu Leu Phe Val Phe Phe Tyr Cys Asp Met Leu Lys He Ala 
180 185 190 

Ser Met His Ser Gin Gin He Arg Lys Met Glu His Ala Gly Ala Met 
195 200 205 

Ala Gly Gly Tyr Arg Ser Pro Arg Thr Pro Ser Asp Phe Lys Ala Leu 
210 215 220 

Arg Thr Val Ser Val Leu He Gly Ser Phe Ala Leu Ser Trp Thr Pro 
225 230 235 240 

Phe Leu He Thr Gly He Val Gin Val Ala Cys Gin Glu Cys His Leu 
245 250 255 

^vr Leu Val Leu Glu Arg Tyr Leu Trp Leu Leu Gly Val Gly Asn Ser 
* y ^™ t« 27D 



Page 3 



WO 01/36473 PCT/USOO/31581 

Leu Leu Asn Pro Leu lie Tyr Ala Tyr Trp Gin Lys Glu Val Arg Leu 
275 280 285 

Gin Leu Tyr His Met Ala Leu Gly Val Lys Lys Val Leu Thr Ser Phe 
290 295 300 

Leu Leu Phe Leu Ser Ala Arg Asn Cys Gly Pro Glu Arg Pro Arg Glu 
305 310 315 320 

Ser Ser Cys His lie Val Thr lie Ser Ser Ser Glu Phe Asp Gly 
325 330 335 

<210> 3 
<211> 657 
•:212> DNA 
<213> H . Sapiens 

•:<3 00> 3 



cagcgcgagc 


gecttcatgg 


tgacggtgt c 


ca tgcgctgg 


cagt gtctgc 


g tgccacccg 


60 


gtgcacctgg 


agcgaggtga 


ggcagagcac 


cgccagcggc 


agcacgaagc 


ccacggcatg 


120 


gugcgtggcg 


gtgaaggctg 


egaagegegg 


acgctcaggc 


t egggeggea 


ggegcagega 


180 


acaggacgcg 


aaggcgctgc 


tgt agecaag 


ccacgagcag 


ccaagtgcag 


cgcctgagaa 


240 


ggccagcgac 


tgt ccccagg 


cacagcccag 


cagcaggccg 


gcat agegeg 


gt cgcaggcg 


300 


t ccggcgt ag 


cgcagtggga 


agcccact gc 


cagccact gg 


t ct geget ca 


gcgccgccac 


360 


get cagcgcc 


gcgt tggacg 


ccaggaaggt 


gtccaggaag 


ccaa tgact t 


ggca t gegee 


4 20 


gggcgccgac 


ggtgt ccgcc 


cgcgcatcac 


accgagcagc 


gtgaagggca 


tgtccagcgc 


480 


cgccagcagc 


aggtggccca 


gagacagatt 


caccaggagg 


aegectgagg 


ctcgagtgcg 


540 


gagct cagcg 


ct gt aggege 


aacaaagcag 


caccagtgcg 


t t ggatagca 


gcgccacggc 


600 


- :^gtaccat c 


accaggagac 


ccgccagcag 


cgect cgccg 


gggcccatgg 


egctage 


657 



<;-io> 4 

<211> 217 
-:212> PRT 
•:213> H. Sapiens 

<A00> 4 

Ser Ala Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu Leu Val Met 
15 10 15 

Val Leu Ala Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys 
20 25 30 

Ala Tyr Ser Ala Glu Leu Arg Thr Arg Ala Ser Gly Val Leu Leu Val 
35 40 45 

Asn Leu Ser Leu Gly His Leu Leu Leu Ala Ala Leu Asp Met Pro Phe 
50 55 60 
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Thr Leu Leu Gly Val Met Arg Gly Arg Thr Pro Ser Ala Pro Gly Ala 
6 5 7 0 7 5 8 0 

Cys Gin Val lie Gly Phe Leu Asp Thr Phe Leu Ala Ser Asn Ala Ala 
85 90 95 

Leu Ser Val Ala Ala Leu Ser Ala Asp Gin Trp Leu Ala Val Gly Phe 
100 105 110 

Fro Leu Arg Tyr Ala Gly Arg Leu Arg Pro Arg Tyr Ala Gly Leu Leu 
115 " 120 125 

Leu Giy Cys Ala Trp Gly Gin Ser Leu Ala Phe Ser Gly Ala Ala Leu 
130 135 140 

Gly Cys Ser Trp Leu Gly Tyr Ser Ser Ala Phe Ala Ser Cys Ser Leu 
145 150 155 160 

Arg Leu Pro Pro Glu Pro Glu Arg Pro Arg Phe Ala Ala Phe Thr Ala 
165 170 175 

Thr Leu His Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu 
180 185 190 

Thr Ser Leu Gin Val His Arg Val Ala Arg Arg His Cys Gin Arg Met 

195 200 205 

Asp Thr Val Thr Met Lys Ala Leu Ala 
210 215 

^;:io> 5 

■ 211> 222 
212> DNA 

■ 213> H.Sapiens 

■ 400,- 5 

tqtgcaggtg tgatctccat tcctttgtac atccctcaca cgctgrtcga atgggatttt 60 

aaaaaggaaa tctgtgtatt ttggctcact actgactatc tgttatgtac agcatctgta 120 

Lataacattg tcctcatcag ctatgatcga tacctgtcag tctcaaatgc tgtaagtcga 180 

acacattaat ttatccccct tagaagatta tgtaaatgta ta 222 






6 

73 
PRT 

H . Sapiens 



•:400> 



6 



Cys Ala Gly Val lie Ser lie Pro Leu Tyr lie Pre Hls Thr Leu Phe 
15 10 15 



Glu Trp Asp Phe Gly Lys Glu lie Cys Val Phe Trp Leu Tnr Thr Asp 
20 25 30 
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Tyr Leu Leu Cys Thr Ala Ser Val Tyr Asn lie Val Leu lie Ser Tyr 
35 40 45 

Asp Arg Tyr Leu Ser Val Ser Asn Ala Val Ser Arg Thr His Phe lie 
50 55 60 

Pro Leu Arg Arg Leu Cys Lys Cys lie 
65 70 



<210> 7 

<211> 507 

< 2 1 2 > DNA 

<2 1 3> H . Sapiens 












<400 > 7 
gaegt cgaag 


caggt ga tga 


tqcccaqgqc 


gt geaceggg 


taggtgagat 




60 


cageggggae 


agggeggt ca 


ggagcagcag 


ccaggtccct 


gcacacgcgg 


ccaccgcg t a 


120 


aegaeggegg 


cgccagcgct 


tggagctgag 


egggtacagg 


a t ccccagga 


agcgctccac 


180 


gctgatacag 


gt cat ggtga 


ggat gctgga 


at acatgttt 


gcgt aaaagg 


ccacggtcac 


240 


cacgltgcaa 


agcagcaccc 


cgaataccca 


gtggtggcgg 


t tgcaatggt 


agt agattt g 


300 


^aaaggcaac 


acgctggcca 


gcat caggtc 


cgtgacgctc 


aggttga t ca 


tgaagatgac 


360 


jrgaeggggat 


ct gggcccca 


tgcgccggca 


cagcacccac 


agagagaaga 


gg t tgecegg 


420 


gatgetgace 


gccgccacca 


gcgagt acac 


caegggcagg 


gccaccgcga 


t cgccgggt t 


480 


ocgcagcatc 


tgeagegteg 


cgtt gt c 








507 



■ 210> 8 

• 211:- 169 
•212;- PRT 

• :2 1 3> H . Sapiens 

•-4 00:- 8 

Asp Asn Ala Thr Leu Gin Met Leu Arg Asn Pro Ala lie Ala Val Ala 
15 10 15 

Leu Pro Val Val Tyr Ser Leu Val Ala Ala Val Ser lie Pro Gly Asn 
20 25 30 

Leu Fhe Ser Leu Trp Val Leu Cys Arg Arg Met Gly Pro Arg Ser Pro 
35 40 45 

Ser Val lie Phe Met lie Asn Leu Ser Val Thr Asp Leu Met Leu Ala 
50 55 60 

Ser Val Leu Pro Phe Gin lie Tyr Tyr His Cys Asn Arg His His Trp 
^5 70 75 80 

Val Fhe Gly Val Leu Cys Asn Leu Val Val Thr Val Ala Phe Tyr Ala 
85 90 95 
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Asn Met Tyr Ser Ser lie Leu Thr Met Thr Cys He Ser Val Glu Arg 
100 105 HO 

Phe Leu Gly He Leu Tyr Pro Leu Ser Ser Lys Arg Trp Arc: Arg Arg 
115 120 125 

Arg Tyr Ala Val Ala Ala Cys Ala Gly Thr Tro Leu Leu Leu Leu Thr 
13C 135 * 140 

Ala Leu Ser Pro Leu Ala Arg Thr Asp Leu Thr Tyr Pro Val His Ala 
1-45 150 155 160 

Leu Gly He He Thr Cys Phe Asp Val 
165 

<210> 9 
<211> 270 
< 2 1 2 > DNA 
<213> H. Sapiens 

<400> 9 



cccatgt tec 


t gctcctggg 


cagect cacg 


ttgt eggate 


tgctggcagg 


cgccgcctac 


60 


gccgccaaca 


tcctact gt c 


ggggccgctc 


aegctgaaac 


t gt cccccgc 


gctct ggt tc 


120 


geaegggagg 


gaggegt ctt 


cgtggcactc 


actgegt ccg 


tgetgagect 


cct gggca tc 


180 


gcgot ggagc 


geagect cac 


ca tggegege 


agggggeccg 


cgcccgt ct c 


cagt eggggg 


240 


cqcacgct gg 


cga tggcagc 


cgcggcctgg 








270 



< 2 1 0 > 10 

<211> 90 

<212> PRT 

<213> H. Sapiens 

<-100> 10 

Pro Met Phe Leu Leu Leu Gly Ser Leu Thr Leu Ser Asp Leu Leu Ala 
: 5 10 15 

Gly Ala Ala Tyr Ala Ala Asn He Leu Leu Ser Gly Pro Leu Thr Leu 

20 25 30 

Lys Leu Ser Pro Ala Leu Trp Phe Ala Arg Glu Gly Gly Val Phe Val 
35 40 45 

Ala Leu Thr Ala Ser Val Leu Ser Leu Leu Gly He Ala Leu Glu Arg 
50 55 60 

Ser Leu Thr Met Ala Arg Arg Gly Pro Ala Pro Val Ser Ser Arg Gly 

= - ^0 7 5 8 0 

Arg Thr Leu Ala Met Ala Ala Ala Ala Trp 
85 90^ 

<110> 11 
< 2 1 1 > RRR 
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<212> DNA 
<213> H. Sapiens 

<400> 11 



ctgct catt g 


tggcct ttgt 


get gggcgca 


ctaggcaa t g 


gggtcgccct 


gtgtggtt tc 


60 


tgcttccaca 


tgaagacctg 


gaagcccagc 


actgttt ace 


ttttcaat tt 


ggccgtggct 


120 


gatttcctcc 


t tatgatctg 


cctgccttt t 


eggacagact 


at tacct cag 


aegt agacac 


180 


rgggct ttt g 


gggacat t cc 


ctgccgagtg 


gggctct tea 


cgttggccat 


gaacagggee 


240 


gggagcatcg 


tgt tccttac 


ggtggtggct 


gcggacaggt 


at ttcaaagt 


ggt ccacccc 


30 0 


caccacgcgg 


tgaacactat 


ct ccacccgg 


gtggcggctg 


geategtctg 


caccctgt gg 


360 


gccctggtca 


tcct gggaac 


agtgt atct t 


ttget ggaga 


accatctctg 


cgt gcaagag 


420 


acggccgtct 


cctgt gagag 


cttca teat g 


gagt eggeca 


atggctggca 


tgacatca tg 


4 80 


t tccagctgg 


agttct ttat 


gcccc tegge 


atcatct tat 


tttgetcett 


caagat tgtt 


5^0 


tggagcctga 


ggcggaggca 


gcagct ggee 


agacaggct c 


gga tgaagaa 


ggcgacccgg 


60 0 


*. r .catcatgq 


tggtggcaat 


tgt gt teat c 


acatget ace 


tgcccagcgt 


gt ctgetaga 


660 


■:*■ ctat ttcc 


tctggacgg t 


gccctcgagt 


gcctgcgatc 


cct ctgt cca 


t ggggccctg 


72 0 


■:acataaccc 


tcagct tcac 


ctacat gaac 


ageatget gg 


at cccct gg t 


gtattat ttt 


780 


-.caagcccct 


cctttcccaa 


a t tctacaac 


aagct caaaa 


t ct gcag t ct 


gaaacccaag 


840 


■ :agccaggac 


actcaaaaac 


acaaaggecg 


gaagagat gc 


caa tttcg 




888 



2 10 
211 
212 
213 



296 
PRT 

H . Sap Lens 



•:400:^ 12 

Leu Leu lie Va 1 Ala Phe Val Leu Gly Ala Leu Gly Asn Gly Val Ala 
15 10 15 

Leu Cys Gly Phe Cys Phe His Met Lys Thr Trp Lys Pro Ser Thr Val 

20 25 30 

Tyr Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu Met lie Cys Leu 
35 40 45 

Pro Phe Arg Thr Asp Tyr Tyr Leu Arg Arg Arg His Trp Ala Phe Gly 
bO 55 60 

Asp lie Pro Cys Arg Val Gly Leu Phe Thr Leu Ala Met Asn Arg Ala 
65 70 75 80 

Gly Ser lie Val Phe Leu Thr Val Val Ala Ala Asp Arg Tyr Phe Lys 
85 90 95 
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Val Val H s s Fro His His Ala Val Asn Thr lie Ser Thr Arg Val Ala 
100 105 110 

Ala Gly lie Val Cys Thr Leu Trp Ala Leu Val He Leu Gly Thr Val 
115 120 125 

Tyr Leu Leu Leu Glu Asn His Leu Cys Val Gin Glu Thr Ala Val Ser 
130 135 140 

Cys Glu Ser Phe He Met Glu Ser Ala Asn Gly Trp His Asp He Met 
145 150 155 160 

Phe Gin Leu Glu Phe Phe Met Pro Leu Gly He He Leu Phe Cys Ser 
165 170 175 

Phe Lys He Val Trp Ser Leu Arg Arg Arg Gin Gin Leu Ala Arg Gin 
]80 185 190 

Ala Arg Met Lys Lys Ala Thr Arg Phe lie Met Val Val Ala He Val 
195 200 205 

Phe He Thr Cys Tyr Leu Pro Ser Val Ser Ala Arg Leu Tyr Phe Leu 
210 215 220 

T rp Thr Val Pro Ser Ser Ala Cys Asp Pro Ser Val His Gly Ala Leu 
225 230 235 240 

His He Thr Leu Ser Phe Thr Tyr Met Asn Ser Met Leu Asp Pro Leu 
245 250 255 

Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Lys Pne Tyr Asn Lys Leu 
260 265 270 

Lvs Tie Cys Ser Leu Lys Pro Lys Gin Pro Gly His Ser Lys Thr Gin 
275 280 - 285 

Arg Pro Glu Glu Met Pro He Ser 
::90 295 



210: 
.211: 

■:212: 
: 2 1 3 : 



13 

510 

DNA 

H . Sapiens 



: '1 0 0 > 13 



tqgagctgtg 


ccaccaccta 


t ct ggtgaac 


ct ga tggt gg 


ccgacct get 


tt atgtgcta 


60 


t igccctt cc 


t catcatcac 


ctactcacta 


gat gacaggt 


ggcect tegg 


ggagct gctc 


120 


tgcaagctgg 


tgcacttcct 


gt t ct at at c 


aacc tttacg 


gcagcatcct 


gctgctgacc 


180 


•jjcatctctq 


tgcaccagtt 


cctaggt gt g 


t gccacccac 


t gtgtt cget 


gccctaccgg 


240 


a:ccgcaggc 


atgcctggct 


gggcaccagc 


accacct ggg 


ccctggtggt 


cctccagctg 


300 


ctgcccacac 


t ggccttctc 


ccacacggac 


t acat caatg 


gecagatga t 


ctgg ta t gac 


360 


atgac:agcc 


aagagaat 1 1 


tgat cggctt 


t tt gcct acg 


gcatagt tct 


gacat tgtct 


420 
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ggctttcttt ccctccttgg tcattttggt gtgctattca ctgatggtca ggagcctgat 480 
caagccagag gagaacctca tgaggacagg 510 

<210> 14 

<:n> 170 

<212> PRT 
<213> H . Sapiens 

<400> 14 

Trp Ser Cys Ala Thr Thr Tyr Leu Val Asn Leu Met Val Ala Asp Leu 
15 10 15 

Leu Tyr Val Leu Leu Pro Phe Leu lie lie Thr Tyr Ser Leu Asp Asp 
20 25 30 

Arg Trp Pro Phe Gly Glu Leu Leu Cys Lys Leu Val His Phe Leu Phe 
35 40 45 

Tyr lie Asn Leu Tyr Gly Ser lie Leu Leu Leu Thr Cys lie Ser Val 
50 55 60 

His Gin Phe Leu Gly Val Cys His Pro Leu Cys Ser Leu Pro Tyr Arg 
6 5 70 75 80 

Thr Arg Arg His Ala Trp Leu Gly Thr Ser Thr Thr Trp Ala Leu Val 
85 90 95 

Val Leu Gin Leu Leu Pro Thr Leu Ala Phe Ser His Thr Asp Tyr lie 
100 105 110 

Asn Gly Gin Met lie Trp Tyr Asp Met Thr Ser Gin Glu Asn Phe Asp 
115 120 - 125 

Arg Leu Phe Ala Tyr Gly lie Val Leu Thr Leu Ser Gly Phe Leu Ser 
130 135 140 

Leu Leu Gly His Phe Gly Val Leu Phe Thr Asp Gly Gin Glu Pro Asp 
-'15 150 15 5 160 

Gin Ala Arg Gly Glu Pro His Glu Asp Arg 
165 170 



2 10.- 


15 


2 11 > 


894 


2 12 - 


DNA 


2 13- 


H . Sapiens 


2 2 0 > 






misc feature 


2 2 2-* 


(4 31) .. (4 61) 


2 2 3> 


n is any nucleotide 


4 0 0> 


15 



ccaccacgcg cagcacgccg acagggcctc tccctcccat tctcccgcag gcccggacga 60 
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ccacgctgcc 


tccagccggt 


cggcaaact a 


gggcagctcg 


cagcccacga 


acagcagccc 


120 


cagcagctgg 


ctcat cttca 


ggctctgca z 


ctt ggcgcgg 


ggcat cgege 


tgggcgcacg 


180 


ggc tccacct 


ggg=tcgccg 


accaggccg z 


tg racccgct 


ggggect tea 


geeggt gecg 


2-10 


ccaccagacg 


gagagtaggt 


ggccacaag r 


ga ;acccatg 


atctt aacag 


gcgcgacgaa 


300 


gcccgcgacg 


gcct ca taga 


acgcgtacac 


ctgcacgtgc 


cagcgctgca 


ggagcgcgaa 


360 


ga tccagtgg 


cagcgacgca 


tccccggcca 


ggctcgggcg 


gagagtgg-g 


cgcctggct g 


4 20 


cagagacgt t 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nagtactagc 


gcaccacaaa 


480 


vcccgacccc 


cgcg:cagca 


g cagtgccag 


cagcc agccc 


agggeggega 


gggcacgcgc 


5 '10 


qggcagcgqc 


cggccgtgcg 


gaagacgcac 


eg rgegcegg 


cgctcgaggg 


cgatgagcac 


60 0 


cacgaggtgg 


gccgaggcgc 


cccgcccgga 


tgect gcagc 


agetgeagga 


agcggcacgc 


660 


caggtccccc 


gt ggccgcgc 


ggggct cgcc 


cagcagt t cc 


caggccagct 


gtgacagege 


720 


■;gtgcccccg 


cacgcgtaca 


ggtccgccag 


ggeca get gc 


accagcagga 


agtccatctt 


780 


■ icgacgcttn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnac 


aggeggcaca 


gcactgt ggt 


840 


■Mtt gcctgcc 


accg :cacca 


ccaggatgac 


ccccaqgaac 


accaggegga 


cgcg 


894 



210> 


16 


2 1 1 > 


2 96 


212> 


PRT 


213> 


H . Sapiens 


22 0> 




221:- 


UNSURE 




(26) . . (35) 


2 2.Hr* 


Xaa is unknown 


220> 




22 1> 


UNSURE 


■-> 'i -> 


( 144 ) . . (154 ) 


2 2 3: > 


Xaa is Unknown 


4 0 0> 


16 



Arg Val Arg Leu Val Phe Leu Gly Val lie Leu Val Val Ala Val Ala 

1 5 10 15 

r,ly Asn Thr Thr Val Leu Cys Arg Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Lys Arg Arg Lys Met Asp Phe Leu Leu Val Gin Leu Ala 
35 40 45 



Leu Ala Asp Leu Tyr Ala Cys Gly Gly Thr Ala Leu Ser Gin Leu Ala 
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50 55 60 

Trp Glu Leu Leu Gly Glu Pro Arg Ala Ala Thr Gly Asp Leu Ala Cys 

65 70 75 80 

Arg Phe Leu Gin Leu Leu Gin Ala Ser Gly Arg Gly Ala Ser Ala His 
85 90 95 

Leu Val Val Leu lie Ala Leu Glu Arg Arg Arg Ala Val Arg Leu Pro 
100 105 110 

His Gly Arg Pro Leu Pro Ala Arg Ala Leu Ala Ala Leu Gly Trp Leu 
115 120 125 

Leu Ala Leu Leu Leu Ala Arg Gly Ser Gly Phe Val Val Arg Tyr Xaa 
130 135 140 

Xrja Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Ser Leu Gin Pro Gly 
145 150 155 160 

Ala Pro Leu Ser Ala Arg Ala Trp Pro Gly Met Arg Arg Cys His Trp 
165 170 175 

lie Phe Ala Leu Leu Gin Arg Trp His Val Gin Val Tyr Ala Phe Tyr 
180 185 L90 

GLu Ala Val Ala Gly Phe Val Ala Pro Val Lys He Met Gly Val Ala 
195 200 205 

Cys Gly His Leu Leu Ser Val Trp Trp Arg His Arg Leu Lys Ala Pro 
210 215 220 

Ala Gly Ala Ala Ala Trp Ser Ala Ser Pro Gly Gly Ala Arg Ala Pro 
225 230 235 240 

Ser Ala Met Pro Arg Ala Lys Val Gin Ser Leu Lys- Met Ser Gin Leu 
245 250 255 

L^u Gly Leu Leu Phe Val Gly Cys Glu Leu Pro Phe Ala Asp Arg Leu 
260 265 270 

Glu Ala Ala Trp Ser Ser Gly Pro Ala Gly Glu Trp Glu Gly Glu Ala 
275 280 285 

Lou Ser Ala Cys Cys Ala Trp Trp 
290 295 

:2 10> 17 

<211> 801 

•:212:> DNA 

<213> H. Sapiens 

<400> 17 

tctaagtttt tctctgaact ttgagcctgt gaaaaaagaa gggatgctgc ctcaggccac 60 

cccagcctag atactcactc tgagtgccat gaggtagtag aggacactga tgacagtcat 120 

ggggaggagg tagaatagga aggaggtgac ctggatgatg aaattgtaga tccacatggg 180 
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ctt gat gacc 


gtacaggtgg 


ccgaacctgg 


gaccagggac 


ccat tgggga 


agt agtggaa 


240 


ctt gat gcca 


tggatgctgg 


tgttgggcag 


ggagaagagc 


aeggagaage 


cccagacgat 


300 


gccgaggat c 


ctgagggccc 


ggcgccgggt 


getctgeagt 


t tggcgcgga 


acgggtgtag 


360 


gat ggccacg 


tagcgctcca 


cgctgacggt 


ggtgatgct g 


agga tggagg 


cgaagcacac 


420 


ggt ctcaaag 


agggcegtet 


tgaagtagca 


gcccacgggc 


ccgaacaaga 


aagggtagtt 


480 


gcgccacat c 


tcatagacct 


ccaggggcat 


tccaaggagc 


aggaccagga 


ggt cagagac 


540 


cgccaggct g 


aagaggt agt 


agttggtggg 


egtcttea ta 


gcctggtgct 


gcagaat cac 


600 


caggcacacc 


aggacat tgc 


caatgacccc 


caccacaaaa 


attggcacat 


acaccacaga 


660 


cacggggagg 


aagaagt ggc 


tgegecgagg 


teegcagagg 


aaggecagat 


actcctcggt 


720 


get gtt cagg 


tgt ttctgga 


atggatctt c 


tagt ttctgc 


tggtagatcc 


aggaagcat t 


780 


ctgaagt ttt 


tccat ccctg 


a 








801 



<210> 18 

<211> 24 9 

<;~:i2> PRT 

-::!13> H. Sapiens 

-:400> 18 

Ser Gly Met Glu Lys Leu Gin Asn Ala Ser Trp lie Tyr Gin Gin Lys 
15 10 15 

L,eu Glu Asp Pro Phe Gin Lys His Leu Asn Ser Thr Glu Glu Tyr Leu 
20 25 30 

Ala Phe Leu Cys Gly Pro Arg Arg Ser His Phe Phe Leu Pro Val Ser 
35 40 45 

Val Val Tyr Val Pro He Phe Val Val Gly Val He Gly Asn Val Leu 

; >0 55 60 

Val Cys Leu Val He Leu Gin His Gin Ala Met Lys Thr Pro Asn Thr 
65 70 75 80 

Tvr Tyr Leu Phe Ser Leu Ala Val Ser Asp Leu Leu Val Leu Leu Leu 
85 90 95 

Gly r-'et Pro Leu Glu Val Tyr Glu Met Trp Arg Asn Tyr Pro Phe Leu 
100 105 HO 

Phe Gly Pro Val Gly Cys Tyr Phe Lys Thr Ala Leu Phe Glu Thr Val 
115 120 125 

Cys Phe Ala Ser He Leu Ser He Thr Thr Val Ser Val Glu Arg Tyr 
130 135 140 

Val A 1 a lie Leu His Pro Phe Arg Ala Lys Leu Gin Ser Thr Arg Arg 
145 150 155 lbO 
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Ara Ala Leu Arg He Leu Gly He Val Trp Gly Phe Ser Val Leu Phe 
165 ' 170 175 

,;er Leu Pro Asn Thr Ser He His Gly He Lys Phe His Tyr Phe Pro 
160 185 190 

Asn '31 y Ser Leu Val Pro Gly Ser Ala Thr Cys Thr Val He Lys Pro 

195 200 205 

Met Trp He Tyr Asn Phe He He Gin Val Thr Ser Phe Leu Phe Tyr 
2 10 215 22 0 

Leu Leu Pro Met Thr Val He Ser Val Leu Tyr Tyr Leu Met Ala Leu 
.-■5 230 235 240 

Arg Val Ser He Ala Gly Val Ala Gly 
24 5 

; 10 > n 

211 ■ 222 
:. 12 ■ DTI A 
213 H . Sapiens 

: 4 00 • 10 

it cci agatija tttttgctat cgtqcaaatt attggatttt ccaactccat ctgtaatccc 60 
it tq: ctatg catttatgaa tgaaaacttc aaaaaaaatg ttttgtotgc agtttgttat 120 



Ujca r . agtaa ataaaacctt ctctccagca caaaggcatg gaaattcagg aattacaatg 
ii qcigaaga aagcaaagtt ttccctcaga gagaatccag tg 

1 i: 20 

2:1 i ■ 3 
■:212 2HT 

: J 13 ■ H . Sapiens 

MOO - 2 0 

He Lys Met He Phe Ala He Val Gin He lie Gly Phe Ser Asn Ser 
L 5 10 15 

I _c -'vs Asn Fro He Val Tyr Ala Phe Met Asn Glu Asn Phe Lys Lys 
20 25 30 

Asn Val Leu Ser Ala Val Cys Tyr Cys He Val Asn Lys Thr Phe Ser 
J 5 4 0 4 5 

Pro A: a nn Arg His Gly Asn Ser Gly He Thr Met Met Arq Lys Lys 
-.0 55 60 

, I a Lys Phe Ser Leu Arg Glu Asn Pro 
>, 5 7 0 

<210 • 2 1 
3 1. • 4 4 7 
.' 1 2 ■ UNA 



180 



WO0I/J647J P< T/l SO../315S1 m ^ 

-'212'-- H. Sapiens 



< 000:- 21 

occacagcat gcagttttct gtagaattcc actttgtctt tgcacttgaa gaagatgagg 
t«-tct.ggtga ccaggatcac cacatagaat aggaaccgtg aggtacatgt ggatgtgcag 



cotgocactc 

g-jtcaagcga aatggcactg tcagcagaaa aacgctgtgg accaccacca agttaatgac 

c: -.icccjtggtg gtcactgacc gggtgttcat tttcaccagg aggaaaagaa tggaaatgac 

ci.-:ccaccaqc ccgccaataa gcactatgaa gtagaggctg attaagtggg gtgtcactat 

a igatcgcaa gaggaattcc tggaggtatt gtggccaggc atacttggga agtcacctgg 

aggagaaaaa gcaccagagt aactgac 

- HC • 22 

-211 ■ 1-19 

• 212> PRT 

*:213> H. Sapiens 

-MOO:- 22 

Val Ser Tyr Scr Gly Ala Phe Ser Pro Pro Gly Asp Phe Pro Ser Met 
I 5 10 15 

Fro Gly His Asn Thr Ser Arg Asn Ser Ser Cys Asp Pro He Val Thr 
20 25 30 

Fro His Leu He Ser Leu Tyr Phe He Val Leu Tie Gly Gly Leu Val 
35 40 45 

GLy Val Tie Ser He Leu Phe Leu Leu Val Lys Met Asn Thr Arg Ser 

50 55 60 

Val Thr Thr Met Ala Val He Asn Leu Val Val Val His Ser Val Phe 
65 70 75 80 

L-u Leu Thr Val Pro Phe Arg Leu Thr Tyr Leu lie Lys Lys Thr Trp 
85 90 95 

b'^t Phe Giy Leu Pro Phe Cys Lys Phe Val Ser Ala Met Leu His He 
100 105 110 

His Me- Tyr Leu Thr Val Pro He Leu Cys Gly Asp Pro Gly His Gin 
115 120 125 

He Pro His Leu Leu GLn Val Gin Arg Gin Ser Gly He Leu Gin Lys 

130 135 140 

Thr Ala Cys Cys Gly 
1 4 5 

• 2 10> 2 3 
■ 211> 222 

Page 15 



60 
120 



jeaaatttge agaagggcag cccaaacatc caagtcttct tgatgaggta 180 



2 4 0 

30 0 

3 60 
420 
447 
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J 1 J • 1: . Sapiens 



a 


4 00 

ctu 


n CC 


"J 

aag 


gtcagggcat cgactgaggc 


l.dgddggcca 


caggaaa t gc 


cagt caaggt 


6 0 


g 


1 td 


rcgcct 


gcaatcgcac ctaccacaaa 


ct t gaccggg 


ggcagggggg 


caggcccgcc 


120 






sac 


icg 


gtcagcagca ccagtccatt 


gcagaqcacg 


gagagcaa ca 


cgatggccca 


180 


c 


<iC(l 


.:cc 


-:gg 


cggatgcccc agctttcaaa 


gaggt actca 


ca 




22 2 




: ir 
J. 1 1 
:. i: 
i_ i > 


■;•} 

PRT 

H.Sapicr.s 












4 01' 




2 4 












1 


ys 


-,lu 


Tyr 


Leu Phe Glu Ser Trp 

5 


Gly lie Arg 
10 


Leu Ala Val 


Trp Ala 
Id' 






1 e 


■/ai 


Leu 


Leu Ser Val Leu Cys 

20 


Asn Gly Leu 
25 


Val Leu Leu 
3C 


Thr Val 




The 


Ma 


Gly 


Gly Pro Ala Pro Leu 
40 


Pro Pro Val 


Lys Phe Val 
45 


Val Gly 






1 a 


; le 


Ala 


Gly Ala Asn Thr Leu 
55 


Thr Gly lie 


Ser Cys Gly 
60 


Leu Leu 






] a 


2 e r 


Val 


Asp Al.a Leu Thr Leu 
70 


Val Ser 










.'10 
.: 1 ;. 

.:■ l 
2 13 




J 4 6 

:>!JA 

■ i . S a p i e n s 












; oo 




tctacacgct caccaaccgc 


gacc t gcgcc 


acqcgctcct 


gcgcctgg tc 


60 




get 


■ icg 


jdC 


gccactcctg cggeagagac 


ccgaq tggct 


cccagcagLc 


ggcgagcgcg 


120 


g 


ctg 




.-t.t 


ccggggg :ct gcgccgctgc 


ct gcccccgg 


gccttgatgg 


gagct t cage 


1 B0 




'JC- 


_:qg 


-gc 


gctcatcgcc ccagcgcgac 


gggctggaca 


ccagcggctc 


ca c a gg cage 


24 0 




•c .? 














24 6 




J 10 
. ' 1 i 
2 12 
21 i 




0 

:>RT 

fl . Sc 


p i e n s 












4 0 0 




: 6 
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Asn Pro He He Tyr Thr Leu Thr Asn Arg Asp Leu Arg His Ala Leu 
15 10 15 

Leu Arg Leu Val Cys Cys Gly Arg His Ser Cys Gly Arg Asp Pro Ser 

20 25 30 

Gly Ser Gin Glr. Ser Ala Ser Ala Ala Glu Ala Ser Gly Giy Leu Arg 

4 0 4 5 

Arg Cys Leu Pro Pro Gly Leu Asp Gly Ser Phe Ser Gly Ser Glu Arg 
50 55 60 

Ser Ser Pro Gin Arg Asp Gly Leu Asp Thr Ser Gly Ser Thr Gly Ser 
*",5 7 0 7 5 8 0 

Pro Gly 



: ".. n . ■ 2 ' 

mh 4.0 

:n> Dr. 'A 

:l'.<> H. Sapiens 



m i sc_f eat ure 
i 81 ) . . ( 10 6) 

ri is any nucleic acid 



■;gr.rj,iaqddC 


aqcgccacca 


tgaccagcat 


gtgcaccacg 


cgcgctctgc 


g 


:cgcgau gc 


60 


f cgcgggvrc 


qcagcctcct 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnntggc 


a 


gagcttgeg 


120 


cqcgat gcgg 


gcgtacatga 


ccacgatgag 


cgccagcggc 


gecaggtaga 


t 


gt gcgagaa 


180 


T.-igc.ica^r q 


gtgt agaccc 


tgcgcatgcc 


ctt ct eggge 


cagqect ccc 


a 


gcaggagta 


24 0 


:<-jq ( r:igq tag 


cagcggt t gc 


gggcgt ccac 


cat gaagtgg 


tqctcctcac 


g 


ggtgacggt 


300 


.-agcgt g:t-:g 


cccgagqgac 


acatgatgag 


cagcgccagg 


geccagatga 


c 


ggcgatggt 


360 


.Tdcu^gcgcc 


1 t ccgcaggg 


tea get t etc 


gcggaaaggg 


t geacgatge 


a 


geggaaect 


420 





1 J- 9 




PF.T 




H . S 








■■' o 




X ;. a 



apienr 



: 4 0 0 



I'he Arg Cys He Val His Pro Phe Arg Glu Lys Leu Thr Leu Arg Lys 
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1 5 10 15 

Ala Leu Val Thr lie Ala Val lie Trp Ala Leu Ala Leu Leu He Met 

20 2 5 3C 

Cvs Pro £er Ala Val Thr Leu Thr Val Thr Arg Glu Glu His His ?he 
35 4 0 ^5 

Ilet Val Psp Ala Arg Asn Arg Ser Tyr Pro Leu Tyr Ser Cys Trp Glu 

50 55 60 

Ala Trp Pro Glu Lvs Gly Met Arg Arg Val Tyr Thr Thr Val Leu Phe 
.;,5 J 70 75 30 

;:; fr His lie Tyr Leu Ala Pro Leu Ala Leu He Val Val Met Tyr Ala 
85 90 95 

Arg He Ala Arg Lvs Leu Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
100 ' 105 113 

Xaa Glu Ala Ala Asp Pro Arg Ala Ser Arg Arg Arg Ala Arg Val Val 
115 12 0 12 5 

His Met Leu Val Met Val Ala Leu Phe Phe Thr 

130 135 

:/10> 2 9 

: 2 1 1 : 318 

:212-- DMA 

■:2 13:- H . Sap i ens 



■;4 00: 2b 

jcaqcqqgeg tqagtcctca ggcacttctt gaggtccttg -tgagcagga agcagacaat 



tggq*. tgacg gcagcctggg cgaagctcat ccaaacagca qtggccaggt agcggtgggg 
-ac;i<:cac<:ig gctttcacaa acactcgcca gtagcaggcc acgatgtagg gtgaccagag 
laqcoqaaag agcagtgtga tcgcgtagaa catgcggccc agcrgctttt cacccttgac 



318 



60 
120 
180 
24 0 

;t;cg'cca:g cccagtagcc gccggctggc tgcatgccca ttctgccqga tacccaqcag 300 
ggtrqqtgqc atgggccc 

;210 ■ 3 0 

;211 • 10 6 

:212 ■ Pi<T 

■'2 13- H Sapiens 

-100 • 3 0 

Gly Pro Met Pro Pro Thr Leu Leu Gly He Arg Gin Asn Gly His Ala 
1 5 10 15 

£\la Ser Arg Arg Leu Leu Gly Met Asp Glu Val I.ys Gly Glu Lys Gin 

20 21 30 

Leu Sly Arg Met Phe Tyr Ala He Thr Leu Leu Phe Leu Leu Leu Trp 
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Ser Pro Tyr lie Val Ala Cys Tyr Trp Arg Va I Phe Val Lys Ala Cys 
50 55 60 

Ala Val Pro His Arg Tyr Leu Ala Thr Ala Val Trp Met Ser Phe Ala 
t">5 7 0 7 5 8 0 

Gin Ala Ala Val Asn Pro lie Val Cys Phe Leu Leu Asn Lys Asp Leu 
8 5 90 95 

Lys Lys Cys Leu Arg Thr His Ala Pro Cys 
100 105 

■:.":10> 31 
■::ll> 3 54 
•:J12> Dt J A 
: . : 1 3 H. Sapiens 

^00> 31 



-..itt ctgt a a 


t gaagaat gt 


cat tcacact 


gecat tggca 


cat ccagtgg 


cct cacct ag 


60 


.".it t qtgaaa 


gccct t egg t 


tggtgtat tg 


ceact t cat t 


tt aaaaggat 


gcacaagt cc 


120 


otgqtgcct t 


t ccacagcaa 


t gcaggt cat 


agtgaggatt 


t ctgt cacaa 


cagegg taga 


180 


•jr. ggacaaat 


ggcaccatct 


t gcaaatgaa 


accacct gca 


gt aaggaaat 


agga t aaa t c 


24 0 


a tacatcaaa 


acaaaaagaa 


taaaggtttc 


atctgtgtct 


tt gtaattat 


cactat cagt 


3 0 0 


ccat t ct gag 


octet gecaa 


aaagt t tgat 


aattgtaatt 


act ctgtaga 


caca 


351 



■:il0.- 32 
. 11 - 117 

■:::i2> PRT 

■:21 3: ■ H. Sapiens 
*:<i00:- 32 

Val Tyr Arg Val lie Thr lie lie Lys Leu Phe Cly Arg Gly Ser Glu 
15 10 15 

Trp Thr Asp Ser Asp Asn Tyr Lys Asp Thr Asp Glu Thr Phe lie Leu 

2 0 2 5 30 

Phe Val Leu Met Tyr Asp Leu Ser Tyr Phe Leu Thr Ala Gly Ala Phe 
3 5 4 C 4 5 

lie Cys Lys Met Val Pre Phe Val Gin Ser Thr Ala Val Val Thr Glu 
50 55 60 

lie Leu Thr Met Thr Cys lie Ala Val Glu Arg His Gin Gly Leu Val 

65 70 7 5 80 

His Pro Phe Lys Met Lys Trp Gin Tyr Thr Asn Arg Arg Ala Phe Thr 

85 90 ^5 

Met Leu Gly Glu Ala Thr Gly Cys Ala Asn Gly Ser Val Asn Asp lie 
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Leu His Tyr Arg lie 
115 

<210 ■ 3 3 

■ 621 

■:212 ■ DUA 

■ : 2 1 3 ■ H.Sapiens 



<:4 00 ■ 3 3 

fjagc. : iacatg atctttttga agtacttgac ggtgtcgttc ttgaoggtca cgaagcacag 
dqtq'tgatc atgctgttgc tcatggcgat gcactcgacg atgtagaagg cagtgaggta 
ijigcttcLL-c ttcacaaaca cggtggggaa gaagtcgcgc acganggtga agccgtagaa 



iigc'icccag catagcacg: aggcggr.gag gatgcacatg agcaocagga _-cqtctt<: 



(.^■gq-dgcqc agcctcttgc ggatctgctc tgtctggaat ccagggaccg :cttgaacca 

<} (J gc* cccgg gagatcctgg catagcacag ggtcatggtg acca :ggggc -:cdcgaa:tc 

:,jttpcaaag ataaagagga agtaggarLt gtagtagagc tgctggtcca oaggccagat 

c^ggjcgcag aagatctttt ccrtggctctt gacaatgacg agga :cgtct -ggtggtgaa 

ij*aggcggaa gggatggcga tcaggutgga caccgtcca; accaiggcaa tcaggccagt 

g gctgt ttgg cacttcattc gtggtctcag cggatggaca atagocagat acctagggca 

a :aacacaag tggaggcagc c 

10 ■ 3 4 

'LIT ■ 207 

<.'I2 • PRT 

:2 13 ■ H . Sapiens 

•;-;00 • 34 

■ ;iv Gys Leu His Leu Cys Ser Cys Pro Arg Tyr Leu Ala lie Val His 
L 5 10 15 

Pt-o Leu Arg Pro Arg Met Lys Cys Gin Thr Ala Thr ^ly Leu lie Ala 
20 25 30 

L'iu Val Trp Thr Val Ser lie Leu lie Ala lie Pro 3cr Ala Tyr Phe 
3 5 4 0 4 5 

Ttir Thr Glu Thr Val Leu Val Ilo Val Lys Ser Gin Glu Lys lie Phe 

50 55 60 

7-/S Gly Gin Tie Trp Pro Val Asp Gin Gin Leu Tyr Tyr Lys Ser Tyr 
65 70 75 B0 

Phe Leu Phe lie Phe Gly lie Glu Phe Val Gly Pro Val Val Thr Met 

8 5 90 95 

Page 2 0 



60 
120 
180 



:t 24 0 



3 00 

3 60 

4 20 

4 13 0 

5 4 0 
600 
62 1 



RNsroan -wo o met 
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Thr Leu Cys Tyr Ala Arg Ilo Ser Arg Glu Leu Trp Phe Lys Ala Val 
100 " 105 110 

Pro Gly Phe Gin Thr Glu Gin lie Arg Lys Arg Leu Arg Cys Arg Arg 
115 12 0 125 

Lys Thr Val Leu Val Leu Met Cys He Leu Thr Ala Tyr Val Leu Cys 

130 135 14C 

Trp Ala Pro Phe Tyr Gly Phe Thr He Val Arg Asp Phe Phe Pro Thr 
145 150 155 160 

Val Phe Val Lys Glu Lys His Tyr Leu Thr Ala Phe Tyr He Val Glu 
165 170 175 

Cys He Ala Met Ser Asn Ser Met lie Asn Thr Leu Cys Phe Val Thr 
180 185 190 

Val Lys Asn Asp Thr Val Lys Tyr Phe Lys Lys lie Met Leu Leu 
195 200 205 

•210- 3 5 

■211 ■ 4 83 

■2 12:- DI-JA 

■ :H 3: H . Sapiens 

■ mO0 :■ 35 



c.igc 


:acact 


gcagtgatga 


aatcaaatgt 


ccaacaccaa 


ccatagtcac 


cattactaac 


6 0 


t aag 


-j a g c c a 


caaaacttcc 


c-tccagqgt 


gt t cagcagc 


agggacaggg 


cccagggcag 


120 




-a cat g 


acagttgaca 


ggtt tcttgg 


gcagcagcag 


cagt a c caga 


taqgccgcag 


1 130 


u^ca 


j a c a q g 


cagcactcag 


t actga t ggc 


actcagcatg 


ctcaggccta 


ca a ggt a ggc 


2 4 0 


t i < 3 a g 


;t cat c 


acgctqgtqa 


agaagct agg 


gaaat tgat g 


gagatggaac 


agaagaagt t 


30 0 


actg 


j g g t a c 


accaggcaa t 


ttataatctg 


gaagcagagg 


aaqaggaagt 


cggc cccgg c 


3 6 0 


cagg 


1 gacg 


acqt agacag 


agaaggcgt t 


cctgcgcatg 


cggaagccca 


ggagccagag 


'3 2 0 


; -.a ca 


idcccg 


1 1 t cct acca 


gcccgaccag 


ggcaatgaaa 


agga t ca gga 


agaccgggat 


■3 8 0 


cag 














4 83 



■210;- 3 C 

:. il: 1M 

■ ; i 2: PET 

■ _ : 3 : • H. Sapiens 

■ -:oo> 3 6 

Lou He Pro Val Phe Leu He Leu Phe lie Ala Leu Val Gly Leu Val 
5 10 15 

Gly Asn Gly Phe Val Leu Trp Leu Leu Gly Phe Arg Met Arg Arg Asn 

2 0 2 5 30 
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Ala Phe Ser Va 1 Tyr Val Leu Ser Leu Ala Gly Ala Asp Phc Leu Phe 
3 5 4 0 4 5 

Leu Cys Phe Gin He lit Asn Cys Leu Val Tyr Leu Ser Asp. Phe Phe 

-0 55 60 

3ys ;;er He Ser He Asn Phe Pro Ser Phe Phe Thr Ser Val Met Thr 

65 7 0 7 5 8 0 

Phe Ala Tyr Leu Val Gly Leu Ser Met Leu Ser Ala lie Ser Thr Glu 

8 5 90 95 

Cys Cys Leu Ser Val Leu Arg Pro He Trp Tyr Cys Cys Cys Cys Pro 
100 105 110 

Arg Asn Leu Ser Thr Val Met Cvs Ala Leu Pro Trp Ala Leu Ser Leu 
115 i:!0 125 

Leu Leu Asn Thr Leu Glu Gly Lys Phe Cys Gly Phe Leu Val Ser Asn 
130 135 1-10 

Gly Asp Tyr Gly Trp Cys Trp Thr Phe Asp Phe He Thr Ala Val Trp 
.45 150 155 160 



H 0 : ■ 


37 


:: 1 1 ; ■ 


33C 


212:- 


DNA 


; 1 1 3 . ■ 


H . S 


4 0 0- 


37 



-j<iga<: tctga 


1 1 ctgactta 


cat ca ca tat 


gt aggcctgg 


geat'tctat 


ttgcagcctg 


60 


r.cct ttgct 


tgtccgttga 


ggt cctagtc 


t ggagccaag 


t g a c a a agac 


agagatcacc 


1 2 0 


\ c i l t * acgcc 


atgtgtgcat 


tgtt aacatt 


gcagccact t 


tgctgatggc 


a g a t g t g t g g 


180 


*. *. c a t t g t g g 


cttcct Ltct 


tagt ggceca 


at aacacacc 


acaagggatg 


t gtggcagcc 


24 0 


jcatTttttg 


gtcatt tctt 


tt acctttct 


gtatt tttct 


ggatgcttgc 


caaggcactc 


30 0 


::tatcctct 


a tgqaat ca t 


gatt gttttc 








330 



..10 • 


3 8 


G ■ 1 1 : ■ 


1 10 


;. 1 2 • 


PRT 


: 2 13:- 


H . S 


; 4 0 0 : • 


3 3 



Glu Ger Leu He Leu Thr Tyr ILc Thr Tyr Val Gly Leu Gly He Ser 
1 5 10 15 

He Cvs Ser Leu He Leu Cys Leu Ser Val Glu Val Leu Val Trp Ser 

2 0 2 5 3 3 
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Gin Val Thr Lys Thr Glu 11c Thr Tyr Leu Arg His Val Cys lie Val 
3 5 4 0 4 5 

Asn lie Ala Ala Thr Leu Leu Met Ala Asp Val Trp Phe lie Va^ Ala 
50 55 60 

Ser Phe Leu Ser Gly Pro He Thr His His Lys Gly Cys Val Ala Ala 
6b 70 75 80 

Thr Phe Phe Gly His Phe Phe Tyr Leu Ser Val Phe Phe Trp Met Leu 

85 90 95 

Ala Lys Ala Leu Leu lie Leu Tyr Gly He Met lie Val Phe 

100 105 110 

•:210> 3 9 

•:211> 6;:8 

■ : . ' 1 2 > DN A 

•'21 3:- H. Sapiens 

■MOO:- 39 



r.tgtgtggca 


gtagagagat 


gt caggct t c 


a gagtcaaca 


agaactggat 


tt caaactgg 


t>0 


at 1 t gaggac 


ccccacct 1 1 


ggt aagtgac 


t tattatctg 


cgagcctct g 


tttctctct t 


i;m.i 


ex. tt: naacga 


ggacagtaaa 


tcccat aegg 


cagggtggtg 


gggagaa t ca 


gagatqata:: 


1 '■■>> < 1 


,i qctg ^tgat 


cacatctggt 


ttgtgt tec z: 


aggggcacca 


gactagggt t 


t.ctgagcatg 


2-; i 


("i atccaaccg 


t cccagt ct t 


eggtacaaaa 


ctgacaccaa 


t :aacggacg 


t gaggagact 


3 o 1 1 


< I'jttcctaca 


atcagaccct 


gagct t cacg 


gt getgaegt 


gcatcatttc 


cct z qt egg. 3 


3»)n 


'-.tgacaggaa 


aegeggt agt 


gctctggctc 


ct gggctacc 


gca t gcgcag 


gaacget gt c 


4 :!(.« 


t i;cat chica 


t cct eaacct 


ggccgcagca 


ga cttcctct 


t cct eager t 


ccagat t at a 


4 M(. 


< ■uttcgecat 


tacgect cat 


caa tat cage 


catcrcatcc 


gcaaaa t cct 


eg 1 1 1 ctgtg 


5 4 i > 


,.-it gacctttc 


cctact 1 1 ac 


aggect gagt 


a t getgageg 


cca t cagcac 


cgaqcgctgc 


e 1 1 o 


crgtctgttc 


t gt ggeccat 


ctggtacc 








620 



• 2 10:- 40 

• 2 11 :- 2 0 5 

• 212.- PPT 

• L 1 3 : H . S a p i e n s 

■4 00:- 4 0 

Leu Cys Gly Ser Arg Glu Met Ser Gly Phe Arg Val Asn Lys Asn Trp 
1 5 10 15 

He Ser Asn Trp lie GJ y Pro Pre Pre Leu Val Ser Asp Leu Leu Ser 
2 0 2 5 3 0 

Ala Ser Leu Cys Phe Ser Leu Leu Met Arg Thr Val Asn Pro He Arg 
35 40 45 
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Gin 


Gly Gly 

5 0 


Gly 


Glu 


Asn 


Gin 
55 


Arg 


Tyr 


Scr Trp 


Ser 
60 


H i s 


Leu 


Vai 


Cys 


Vai 
6 7. 


Pro Arg 


Giy 


Thr 


Arg 

7 0 


Leu 


Gly 


Phe 


Leu Ser 

75 


Met 


Asp 


Pro 


Thr 


Vai 
80 


Pro 


Vai Pho 


Gly 


Thr 

85 


Lys 


Leu 


Thr 


Pro 


lie Asn 
90 


Gly 


Arg 


Glu 


Glu 
9 5 


Thr 


Pro 


-ys Tyr 


Asn 

100 


Gin 


Thr 


Leu 


Ser 


Phe 
105 


Thr Vai 


Leu 


Thr 


Cys 
110 


He 


He 


Ser 


Leu Vai 
115 


Gly 


Leu 


Thr 


Gly 


Asn 
120 


Ala 


Vai Vai 


Leu 


Trp 
12 5 


Leu 


Leu 


Gly 


Tyr 


Arg Met 
1 3 0 


Arg 


Arg 


Asn 


Ala 
135 


Vai 


Ser 


lie Tyr 


He 
14 C 


Leu 


Asn 


Leu 


A.l a 


A.l a 
1 -15 


Ma Asp 


Phe 


Leu 


Phe 
150 


L eu 


Ser 


Phe 


Gin lie 

155 


He 


Arg 


Ser 


Pro 


Leu 
160 


Arg 


Leu lie 


Asn 


lie 
165 


Ser 


His 


Leu 


lie 


Arg Lys 
170 


He 


Leu 


Vai 


Ser 
175 


Vai 


:•]<- 1 


i hi Phe 


Pro 
1H0 


Tyr 


Phe 


Thr 


Gly 


Leu 
185 


Ser Met 


Leu 


Ser 


Ala 

1 90 


He 


Ser 


Thr 


.'■Lu Arg 
195 


Cys 


Leu 


Ser 


Vai 


Leu 
200 


Trp 


Pro lie 


Trp 


Tyr 

2 05 








•::.-l0 
•:. 1 1 

•:: 12 

. ■ ■ 1 -3, 


■ 4 :. 

■ 3 : ° 

DMA 

H.Sapien 
























■MOO 
a aq 


4 I 
- a a g c a 


agqc 


caccag gaccttaggc 


atagtcatgg 


g a g t g 1 1 


tgt 


gttgtgctgg 


or go 


.-cttct 


ttgtcttgac gatcacagat 


cct 1 1 cat t a 


atttta e 


aac 


cct t gaagat 


Ct.gt 




tctt 


cctctg gctaggctat 


ttcaactctg 


ctttcaatcc 


:att 


ttatat 


ggcs 


vgcttt 


atcc 


ttggtt. t 


;qcaaggca 


t t gaggatga 


t tgtcac 


agg 


-atqatct Lc 


' :..] cc 


-t gact 


cttc 




t aagcctgttt 


t ctgcccatg 


cttagg c 


tgt 


gttc 


atcatt 


< : a a t 


■: ggact 


ctt ttctgg 




















■:. 10 
1 

■:. 12 
•.. 13 


4 2 

i o : 

PRT 

H . Sapiens 






















<4 00 


42 


























Thr 
i 


jlu Sor 


Lys 


Ala 


Thr 


Arg 


Thr 


Leu 


Gly He 

10 

Page 


Vai 
2 4 


Met 


Gly 


Vai 
1 5 


Phe 



60 
120 
180 
24 0 
300 
319 



RNSUX:;:D <WO 



WO (» 1/36473 PCT/US00/31581 



Val Leu Cys Trp Leu Pro Phe Phe Val Leu Thr He Thr Asp Pro Phe 

20 25 30 

lie Asn Phe Thr Thr Leu Glu Asp Leu Tyr Asn Val Phe Leu Trp Leu 
35 4(f 45 

Gly Tyr Phe Asn Ser Ala Phe Asn Pro lie Leu Tyr Gly Met Leu Tyr 

5 0 5 5 60 

Pro Trp Phe Arg Lys Ala Leu Arg Met He Val Thr Gly Met He Phe 
o5 70 75 80 

His Pro Asd Ser Ser Thr Leu Ser Leu Phe Ser Ala His Ala Ala Val 

85 90 95 

Phe He He Gin Asp Ser Phe 

100 

-:210> 43 

•:211:- 515 

■ : :": 1 2 : - dna 

■ 1 3 ■ H . Sapiens 



•:400:- 4 3 



t. d g g d a t :tc 


agaga agaaa 


gtaaggaacc 


agaaa accat 


aaaagaat gt 


aaatggaaaa 


60 


gaa t cage a a 


atcttat tea 


ct t at cacta 


aatctaaaat 


atgt caaaa t 


acat gaagac 


120 


aci :a a a t gc t 


t tagaacaac 


tgtt gaatgt 


at tgt cctac 


aacttggcat 


a tgatca t gc 


18 0 


1 1- gcctctct 


atgtccaagt 


gt t tattttt 


gcagt tgacc 


t t aatttcaa 


gt tagt 1 1 1 g 


24 0 


a ag t ct c t a c 


agtaa tgt tt 


1 1 aa t ct gtc 


tctacttctt 


cagaaaat aa 


at tagttgt t 


30 0 


gaogcat cag 


tcct t aagac 


ct t gecgett 


acaataagtt 


t t attgeett 


cccaaacuat 


3 60 


i" . c:g t a a a.aga 


aagca t a aat 


caaggggt t c 


a t agct gaat 


tataataaac 


acaccaaact 


4 2 0 


,."iadat ctcat 


aaaca t aagg 


aggagt tat a 


aaat t catat 


aagca tcaat 


cac tgea t ca 


4 80 


ucgaogt a t g 


gt ageca aga 


ga caagaaat 


gctgc 






515 



■.::lO: 4 4 

•Oil:- 143 

•:i2> PRT 

H L 3: H . Sapiens 

v A 00: 4 4 

Leu His Gin Arg Gly Met Val Ala Lys Arg Gin Glu Mot Leu Ala Ala 
15 10 15 

Phe Leu Val Ser Trp Leu Pro Tyr Leu Val Asp Ala Val He Asp Ala 
20 25 30 

Tyr Met Asn Fhe He Thr Pro Pro Tyr Val Tyr Glu He Leu Val Trp 
35 40 45 
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Cys Val Tvr Tvr Asn Ser Ala Met Asn Pro Lea 

50 ' ' 55 

T /r Gin Trc Phc Gly Lys Ala lie Lys Leu lie 

6 5 ' 7 0 7 5 

Leu Arq Thr Asp Ser Ser Thr Thr Asn Leu Phc 

85 90 



Thr Asp Lys H^s 
100 



Tyr Cys Arg Asp Leu Lys Thr 
105 



Thr A] a Lys lie Asn Thr Trp Thr Arg Gly 



1 ]. 5 



120 



Pro Ser Cys Arq Thr He His Ser Thr Val Val 
130 * 135 

S*-r Ser Cys Tie 



lie Tyr Ala Fho Phe 

60 

Val Ser G.l y Lys Va 1 

8 0 

Ser Giu Glu Val Glu 

9 5 

Asn Leu Lys Leu Arg 
1 1 0 

Lys His Asp His Met 
125 

Leu Lys Hrs Leu Leu 
1 4 0 



• : : : o • 45 

• . I 1 72 6 

■:: 12 :■ DNA 
: _ i 3 : ■ H . S a p i e n s 

■ : 4 0 0 : • 4 5 

ij r gg L iaagag gtcctcgatc tatccLctdC gccgtccttg 

g -:gtTtggaa ac.tactqg: catgattgct atccttcact 

,i- -.jat:ct t tc tgattgegre gctggcctqt gctgacttct 

- ttcagca cagtgagqt z tgtggagagc tgttggtaot 
t :c,.ta:^t gttttgaca: atctttctgt tttgettett 

- r.gt tg at a gatacattg: tgttactgat octet gacct 
■;-ig-:tt :ag ggatatgeae tgttctttcc tggttctttt 

i *■ ct * t taoa egggagecaa cgaagaagga attgaggaat 

■j - aggagg:t gccaggctc: actgaatcaa aactgggtcc 

L :a?.acc za atgtcgccat ggtgtttata tacagtaaga 

^.-.ggjtagga agatagaaag tacaqccagc caagctcagt 

j iaagagv.ag caaaaagaga gagaaaggct gr.caaaacct 

• r t c: t 



gt 1 1 tggggc 
totaacaact 
tqgtgggagt 
t iggggacag 
tatttcatt t 
a t ccaaccaa 
ct g tea cat a 
tagtagttgc 
tactttgttt 
t at ttttggt 
cct t ctcaga 
t g g g a a 1 1 g c 



tgtgctggca 
qcacacacct 
cactgtgatg 
ttactgtaaa 
at get g t ate 
gtttactgtg 
eager 1 1 teg 
tct aa ectgt 
tctt c t a 1. 1 c 
ggeca a g e a t 
gagt t. a eaag 
t a t g g e a g c a 



6 0 
1 2 0 

1 8 0 

2 1 0 

3 0 0 

3 6 I) 

a 2 :> 

4 8 J 

5 A 1 

6 0 0 
6 60 
1 2 0 
72 6 



: 1 0 - 4 6 

■'11- 2 4 i 

112 > PRT 

ll'i> H 



a p i e r 
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<.U)C> 4 6 

Leu Glu Arg Gly Pro Arg Ser lie Leu Tyr Ala Val Leu Gly Phe Gly 
15 10 15 

Ala Val Leu Ala Ala Phe Gly Asp. Leu Leu Val Met He Ala He Leu 
20 ' 25 30 

His Phe Gin Leu Has Thr Pro Thr Asn Phe Leu lie Ala Ser Leu Ala 

:j 5 4 0 4 5 

Cys Ala Asp Phe Leu Val Gly Val Thr Val Met Pro Phe Ser Thr Val 
5 0 5 5 60 

Arg Ser Val Glu Ser Cys Trp Tyr Phe Gly Asp Ser Tyr Cys Lys Phe 
65' 7 0 7 5 8 0 

His Thr Cys Phe Asp Thr Ser Phe Cys Phe Ala Ser Leu Phe His Leu 

85 90 95 

Cys Cys lie Ser Val Asp Arg Tyr He Ala Val Thr Asp Pro Leu Thr 
100 J 105 110 

Tyr Pro Thr Lys Phe Thr Val Ser Val Ser Gly He Cys lie Val l,eu 
i:5 120 12 5 

Ser Trp Phe Phe Ser Val Thr Tyr Ser Phe Ser He Phe Tyr Thr Gly 
130 135 140 

Ala Asn Glu Glu Gly He Glu Glu Leu Val Val Ala Leu Thr Cys Val 
14 5 150 155 160 

Gly Gly Cys Gin Ala Pro Leu Asn Gin Asn Trp Val Leu Leu Cys Phe 
165 170 175 

Leu Leu Phe Phe He Pro Asn Val Ala Met Val Phe He Tyr Ser Lys 
180 185 190 

Lie Phe Leu Val Ala Lys His Gin Ala Arg Lys He Glu Ser Thr Ala 

1^5 200 205 

Ser (Jin Ala Gin Ser Phe Ser Glu Ser Tyr Lys Glu Arg Val Ala Lys 
110 215 22 0 

Arc Glu Arg Lys Ala Ala Lys Thr Leu Gly He Ala Met Ala Ala Phe 
2 2 <i ' 23 0 235 240 



■CIO:- 

• HI 660 

<::12> DMA 

<2 1 3> H . Sapi ens 

. 4 0 0> A 7 

aaccaggtgg ccttactcct aagacccctg gccttgtcta tggcctttat caacagctgt 60 
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ctcaatccag ttctctatgt cttcattggg catgacttct gggagcactt gctccactcc 120 

ctgctagctg ccttagaacg ggcacttagc gaggagccag atagtgcctg aatcccagct 180 

cccaggcuga tgagtecttt ataacatgac ccaatitcct ac:cca:ttt cccacractc 24 0 

aatcctcttc craaacagct ctaccataat ccaacatcca acagaattta agagaataaa 300 

ccacaact:*: taagtgagct ctatgtgc:a ggtcatgttt tagaat.ac.aa cctraagcgc 360 

oi:qgaaga:g gaggcaagaa acaaacaagg tctcartctr tagaggaaga cagttcacca 420 

agactcaaac agaaaaaaag atagttatct tgtgacaaaa caagtcat.aa aattgqgtca 480 

gijaoctgcag caatgacttt atgctagaat ccagagcact agcaggaaac tgcttaaatt 54 0 

*.r:acttaatc aaagtcaagt ttggacatac atgtcaggta aaacctagca gagatgagct. 600 

aoc^ taatr.t. taaaacttca agggaLagct caatgtcatc aagatccttt tgatgacttg 660 

j i : • as 

211 • 211 

■:.!12e PRT 

3:- H. Sapiens 



Asn Gin Val Aia Leu Leu Leu Arg Pro Lou Ala Leu Sor Met Ala Phe 

1 5 10 15 

: Le Asn Ser Cys Leu Asn Pre Val Leu Tyr Val Phe lie Gly His Asp 

2 0 2 5 30 

Phe Tip Glu His Leu Leu His Ser Leu Leu Ala Ala Leu jlu Arg Ala 
3 5 4 0 4 5 

Leu Sec Gla Glu Pro Asp Ser Ala lie Pro Aia Pro Arg Gin Met Ser 
50 5 5 60 

Pro Leu His As d Pro He Ser Tyr Ser He Phe Pro Pro Leu Asn Pro 
o'j ' 70 7 5 3 3 

Leu Pro Lys Gin Leu Tyr His Asn Pro Thr Ser Asn Arg He Glu Asn 

8 5 90 9 5 

L/s Pro 3.1 n Leu Leu Ser Glu Leu Tyr Val Leu Gly His Val Leu Glu 
100 10 5 110 

"C rr Apt. Leu Lys Cys Leu Glu Asd Gly Gly Lys Lys Gin Thr Arg Set 
115 120 125 

His Ser Leu 31 u Glu Asp Ser Ser Pro Arg Leu Lys Gin Lys Lys Arg 
13 0 1 3 5 14 0 

Leu Ser Gys Asp Lys Thr Ser His Lys lie Gly Ser Gly Pro Ala Aia 
14 5 150 155 160 

Met Thr Leu Cys Asr: Pro Giu His Gin Glu Thr Ala He Leu Leu Asn 
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165 



US 



Gin Ser Gin Val Trp Thr Tyr Met Ser Gly Lys 
180 ' ' 185 



Thr Gin Arg Al^ 
190 



Lei 



lie Leu Lys Leu Gin Gly lie Ala Gin Gys 
195 200 



His Gin Asp Pro Phe 
205 



Aop Asp Leu 

;_ io 



.J 10 

jii 

1 1 2 
113 



4 9 
4 65 
DNA 

H . Sapiens 



: 4 0 0 



4 9 



get t'lttcac 


ggccacca t c 


ct caagctgt 


t gcgcacgga 


ggagg cgcac 


ggcegggage 


6 0 


ugcggaggcg 


cgcggtggg : 


ct ggccgcgg 


tggtcttget 


ggcctttgtc 


acctgcttcg 


120 


c :cccaac«3a 


cttcgtgct-: 


ct ggcgcaca 


t eg t gagecg 


cctgtt ct ac 


ggcaagagct 


180 


a :taccacqt 


gtacaagct-: 


acgctgtgtc 


t cagctgcct 


caacaact g t 


ctggacccgt 


24 0 


l g 1 1. 1 a 1 1 a 


ct t tgcgtcc 


cgggaat tec 


agct gegee t 


gegggaata t 


ttgggctgcc 


300 


g ccqugtg cc 


caqaga ca co 


ctggacacgc 


gecgegagag 


cctcttctco 


gccaqqacca 


360 


ojitcrgtgeg 


ctccgaggcc 


ggtgcgcacc 


ct gaagggat 


ggagggagee 


accaggcccg 


420 


gcct. ccagag 


gcaggagag" 


gtgt tctgag 


tcccgggogc 


gcagc 




4b5 



50 

1*50 

PRT 

H . Sapiens 



5 0 



Lt-u Phe Thr Ala Thr lie Leu Lys Leu Leu Arg Thr Glu Glu Ala His 



10 



15 



Uy Arg Clu Gin Arg Arg Arg Ala Val Gly Leu Ala Ala Val Val Leu 

20 " 25 30 



Leu Ala Phe Val Thr Gys Phe Ala Pro Asn Asn Phe Val Leu Leu Ala 



3 5 



40 



4 5 



Hi 5 lie Val Ser Arg Leu Phe Tyr Gly Lys Ser 

50 55 



Tyr Tyr His Val Tyr 

60 



Lys Leu Thr Leu Cys Leu Ser Cys Leu Asn Asn Cys Leu Asp Pro Phe 
7 0 7 5 8 0 

'/a I Tyr Tyr Phe Ala Ser Arg Glu Phe Gin Leu Arg Leu Arg Glu Tyr 
8 5 90 9 5 
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Leu Gly Cys Arg Arg Val Pro Arg Asp Thr Leu Asp Thr Arg Arg Glu 
100 105 110 

Ser Leu Fr.c Ser Ali Aig Thr Thr Ser Val Arg Ser Glu Ala Gly Ala 
] 1 5 12 0 12 5 

His Pro Glu Gly Met Glu Gly Ala Thr Arq Pro Gly Leu Gin Arg Gin 
130 135 1^0 

Glu Ser Val Phe Va L Pro Gly Ala Gin Ala Ala Pro Pro Gly Leu Arg 
14 5 15 0 15 5 160 

2 1 0 > 51 

211> 603 

21. :: DNA 

2 1 H . Sapi ens 

A ■)■' 51 



v . tar: 


. tatt c 


tgecett tat 


cc-aa 


c 1 1 1 1 a 


attccctttg 


ctattctc :: t 


qectea 


t 1 1 1 


60 


-tg<j 


. ctcat 


tttccctatt 


at;c 


t goctc 


acattgatca 


agggatgagg 


Ct ggc ^ 


ggat 


120 


ocgu 


iaccca 


caqqqccccg 


tggg 


ccatga 


gaggct cct g 


qacttgaa :c 


tcagg a 


:act 


1H0 


:c-:,i 


.■ t ctcg 


ctqeegg rag 


ggat 


ggaagc 


t g g a t g a g c a 


ggcaggaqct 


ggoagt 


*g : 39 


2 ■? 0 


jt: go 


.= gugcc 


a t a q g c t a 1 1 


gggg 


tggaca 


ggcttgggt g 


cctcatggga 




: a t g 


300 


gg ri'.; 


t g t g g 


ceccttgggq 


cct s 


tta t tt 


ctcacccccg 


gctttcccgg 


gaga g g 


ttca 


3 60 


■Kit : : 


s g a a a a 


tgccccas a g 


at:: 


acgtgg 


ccctgggtgg 


caqcctgrtc 


c:cc: g 


■late 


4 2 0 


•:q -: 


■ tret t 


qgtcaafgtq 


qggagtgg zt 


ca aaqgggt c 


t gat get gec 


tget gg 


jeec 


4 H 0 


•JO 30 


::gcrct 


cttccaccac 


ttcc 


t got ct 


gegee 1 1 cac 


c t ggat gggc 


ct tga agect 


5 30 


KCZ ."J 


-otct a 


cct get c get 


gtc a 


gggtct 


tcaacaccta 


ct tegggcac 


tacttc 


-tga 


6 30 


J q C 


















60 3 


:21 J 
■:21 ] 
:2lJ 
:21 < 


5 2 
198 
PF.T 
> H.Sc 


j p i e n s 

















: '1 0 0 > 5 z 

11 u Thr Tyr Ser Ala Leu Tyr Pro Thr: Phe Asm Ser Leu Cys Tyr Ser 

15 10 15 

Pre Ala Ser Phe Ser Gly Leu Tie Phe D ro lie He Leu Pro His lie 

20 25 30 

Asp Gin Gly Met Arg Leu Ala Gly Ser Gly Thr His Arg Ala Pro Trp 

3 5 4 0 4 5 

Ala Met Ara Gly Ser Trp Thr Thr Ser Gly His Ser His Ser Gly Cys 

50 " 5 5 60 
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Arg Gin Gly Trp Lys Leu Asp Glu jln Ala Gly Ala Gly Ser Gly Gly 

bS 7 0 7 5 BC 

Gly Glu Pre A La He Gly Val Asp Arg Leu Gly Cys Leu Met Gly Ala 
8 5 90 95 

Pro His Glv Ser Cys Gly Pro Leu Gly Pro Leu He Ser His Pro Arg 
" 100 105 110 

Leu Ser Arg Glu Arg Phe Lys Ser Glu Asp Ala Pro Lvs He His Val 
115 120 125 

Ala Leu Gly Gly Ser Leu Phe Leu Leu Asn Leu Ala Phe Leu Val Asn 
1 3 0 13 5 14 0 

Val lily Ser Gly Ser Lys Gly Ser Asp Ala Ala Cys Trp Ala Arg Gly 
14 5 150 155 160 

Ala Val Phe His Tyr Phe Leu Leu Cys Ala Phe Thr Trp Met Gly Leu 

1 6 5 17 0 17 5 

Glu Ala Phe His Leu Tyr Leu Leu Ala Val Arg Val Phe Asn Thr Tyr 
180 18 5 190 

Phe Gly Hi? Tyr Phe Leu 
195 

•'HO • 53 

- .• : n 335 

• : 2 1 2 : • DNA 

■ : 2 1 3 : • H . Sapiens 

■ : 0 0 : • 5 3 



a a 1 1. ggtegg 


agaqtgeage 


tgcttgaaat 


ggaggattga 


aatcatcacc 


aggaggt 1 1 c 


60 


aa ca cage 


cagcacagcc 


ccaaagccaa 


acact atgta 


cagaat cacc 


cgggatcccg 


120 


q<_:gar?aaggg 


gattttcaca 


caggacccat 


tcacgttcgc 


gtagcacagc 


tgcacagcca 


180 


<-.< :aqcaggga 


t g a a 1 1 g c t g 


ctcataacgc 


tggtatttac 


a ta t ggagaa 


at t ttgtcct 


240 


^.qttcattat 


cacaaaaaat 


acaggattgt 


rcctgatitt 


cattgctcct 


gecgaaaaaa 


300 


a c a c a t a 1 1 c 


accaggatgc 


cagaggaaa t 


gatca 






335 



'.'10:- 5.1 

: . ' 1 1 : ■ 111 

J 1 2 • PRT 

. 2 1 3 : • H. Sapiens 

:400> 54 

Asp His Phe Leu Trp His Pro Gly Glu Tyr Val Phe Phe Ser Ala Gly 
1 5 10 15 

Ala Met Lys lie. Arg Asn Asn Pro Val Phe Phe Val Tie He Asn Lys 
2 0 ' 2 5 3 0 
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Asp Lys He Ser Pro Tyr Val Asn Thr Ser Val Met Ser Ser Asn Ser 
35 4 0 4 5 

Ser Leu Leu Va! Ala Val Gin Leu Cys Tyr Ala Asn Va I Asn Gly Ser 
50 55 tSO 

'"""s Val Lys He Pro Phe Ser Pro Gly Ser Arg Val Tie Leu Tyr lie 
•5S 7 0 7 5 8 0 

Val Phe Gly Phe Gly Ala Val Leu Ala Val Phc Gly Asn Leu Leu Val 
85 90 95 

Met He Ser lie Leu His Phe Lys Gin Leu His Ser Pro Thr Asn 
100 105 110 

■:210> 55 
;L11: 586 
12 ■ DMA 
■-'LI 3: ■ H . Sapiens 

;00: 





cttaa 


c:aagactqaa 


aaaca 1 1 gat 


ttgtttttaa 


tttgaagagc 


aatttatttg 


hO 


:f.at* 


cattc 


atagtcttac 


ttgattttta 


aaaact.catt 


tcgc:ttggta 


attttaaagg 


120 




• t g a a c 


t. t c atctat c 


ca.DCtgctta 


ta t at gtt ca 


ijaaaacaaat 


teatgyttge 


1 3 0 


tqaa. 


■tqttc 


tttaaaacct 


gaccagt tac 


aataactttt 


artgetttec 


taaaccargg 


2-3 0 


gt.aa, 


j a t a a a 


gcataaatca 


aagga t t_a; 


ggc t gagt t a 


•.aataagcac 


accaacagca 


300 


teat , 


iaatac 


aggcaggggt 


tataaag-cc 


at aaaggca t 


caattaa t ga 


at caa tqcta 


3o0 




jtaacc: 


a tgaaatcat 


aaatgetace 


act at gaccc 


ccagggt 1 1 1 


age t get 1 1 1 


4 20 


-ret. 


"tctcc 


tqgccactct 


gg:tttgtaa 


ct ct ctgagg 


atgattctgt 


ct t get acca 


4 : i0 


3*at' 


. 1 1 c t a 


tot 1 1 1 tege 


ctgtcqtcta 


gecaca agaa 


atatgttacc 


a Lacagaatt 


54 3 


it cataataa 


aqgtaggtat 


aaagaaggat 


agaaaa tct g 


tcaaca 




5 H 6 



•3 1 0 • 5 6 

Hll • 190 

H12 • PPT 

; 2 1 3 - H . Sapiens 

;oo • 5 6 

Lou Thr Asp Phe Leu Ser Phe Phe Tie Pro Thr Phe He Met He He 
15 10 15 

Leu Tyr Gly Asn He Phe Leu Val Ala Arg Arq Sin Ala Lys Lys He 

2 0 2 5 30 

Uu Asn Thr G]y Ser Lys Thr Glu Ser Ser Ser Glu Ser Tyr Lys Ala 
3 5 4 0 4 5 
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Arg Val Ala Arg Arg Gin Arg Lys Ala Ala Lys Thr Leu Gly Val Thr 

5 0 55^ 6 0 

Val Val Ala phe Met lie Ser Trp Leu Pro Tyr Ser lie Asd Ser Leu 
o5 70 75 ' 80 

lie Asp Ala Phe Met Gly Phe lie Thr Pro Ala Cys lie Tyr Glu lie 
85 90 95 

Cys Cys Trp Cys Ala Tyr Tyr Asn Ser Ala Met Asn Pro Leu lie Tyr 
100 105 110 

Ala Leu Phe Tyr Pro Trp Phe Arg Lys Ala lie Lys Val lie Val Thr 
115 12C 125 

G.^y Gin Val Leu Lys Asn Ser Ser Ala Tnr Met Asn Leu Phe Ser Glu 

130 135 140 

His lie Ala Val Gly Thr Lys Phe Arg He Pro Leu Lvs Leu P'ro Ser 
1*15 150 155 ' 160 

Glu Met Ser Phe Lys Ser Ser Lys Thr Met Asn Glu Gin He Asn Cys 
16b 170 175 

Ser Ser Asn Lys Gin lie Asn Val Phe Gin Ser Cys A^p Val 
180 185 190 



:::io 
:;: ii 
2 

213 



57 

976 

DNA 

H.Sdpiens 



400:- 57 

- ttgtggcaa ggagaccctg atcccggtct tcctgatcct tttcattgcc ctggtcgggc 60 

\ gqt <. : ggaaa cgggtttgtg ctotggctcc tgggcttccg catgcgcagg aacgccttct 120 

■_:t gt c-t acgt cctcagcctg gccggggccg act tcctct t cctctgcttc cagattataa 180 

.i t z gee tggt gtacctcagt aacttcttct gttcratctc catcaatttc c zz agct t c t 24 0 

* cac. actgt gatgacctgt gcetaccttg cagg :ctgag catgctgagc accgtcagca 300 
■-cgagcgctg cctgtccgtc ctgtggcoca tctggtatcg ctgccgccgc c:cagacacc 360 

* g:c.;gcggt cgtgtgtgtc ctgctctggg ccctgtccct actgctgagc atcttggaag 420 
gqaacttctg :ggcttctta tttagtgatg g-.ga :tctgg ttggtgtcag a;atttgatt 480 
tcat'-actgc agegtggetg atttttttat teat ^gttct ctgtggqtcc agtctggccc 540 
tgctggtcag gatcctctgt ggctccaggg gtctgccact gaccaggctg tacctgacca 600 
tcctgctcac agtgctggtg tccctcctct gcggcctgcc ctttggcat: cagtggttcc 660 
taat.-ttatg gatctggaag gattctgatg tcttattttg :ca;a:tcar. ecagtttcag 720 
ttgt'-ctgtc atctcttaac ageagtgeca accccatcat -.tacttctt gtqggctctt 73C 
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ttaggaagca gtgqcggstg cagcacccga Lcctcaagct cgctctccag agggctctgc 8-10 



aggacattgc tgaggtggat cacagtgaag gatgcttccg tcagggcacc cggagattca 
aaga-agcatt ctggtgtagg gatugacccc tctacttcca tcatatatat gtggctttga 
gaggcaactt tgcccc 



:2 10> 


5 8 


::: 11: ■ 


32 4 


:: l 2 


PRT 


:3 13- 


H .Sapiens 


'■' •() ■ 




: 2 2 1 : ■ 


UNSURE 




(266) . . (266) 




Xaa is Unknown 


•; jo:- 


58 



Cys Gly Lys Glu Thr Leu Tie Pro Vai Phe Leu lie Leu Phe He; Ala 
l" 5 10 15 

Lei Val Gl / Leu Val Gl/ Asn Gly Phe Val Leu Trp Leu Leu Gly Phe 

2 0 2 5 30 

Arg [let Arg Arg Asn Al -a Phe Ser Val Tyr Val Leu Ser Leu Ala Gly 
3 5 4 0 4 5 

A Li Asp Phe Leu Phe Leu Cys Phe Gin Tie He Asn Cys Leu Val Tyr 
50 5 5 60 

J.ej Ser Asn Phe Phe Cys Ser lie Ser Tie Asn Phe Pro Ser Phe Phe 
,i5 70 75 80 

Thr Thr V r: 1 Met Thr Gys Ala Tyr Leu Ala Gly Leu Ser Met Leu Ser 

3 5 90 9 5 

Tnr Val Ser Thr Glu Arg Cys Leu Ser Val Leu Trp Pro He Trp Tyr 
100 105 110 

Arg Gys Arg Arg Pro Arg His Leu Ser Ala Val Val Cys Val Leu Leu 
115 120 125 

Tro Ala Leu Ser Leu Leu Leu Ser He Leu Glu Gly Lys Phe Cys Gly 

130 135 14 0 

Pne Leu Phe Ser Asp Gly Asp Ser Gly Trp Cys Gin Thr Phe Asp Phe 
115 153 155 160 

He Thr Ala Ala Trp Leu He Phe Leu Phe Met Val Leu Cys Gly Ser 
165 HO 175 

Ser Leu Ala Leu Leu Val Arg He Leu Cys 31 y Ser Arg Gly Leu Pro 
180 18 5 190 

Lou Thr Arg Leu Tyr Leu Thr lie Leu Leu Thr Vai Leu Val Ser Leu 
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900 
9 60 
9 7 6 



W O 01/36473 



per a ;soo/3i5si 



195 



Leu Hys Gly Leu Pro Phe Gly lie Gin Trp Phe 



.10 



215 



Leu lie Leu Trc lie 

220 



rrp Lys Asp Ser Asp Val Leu Phe Cys His lie 

22 5 2 30 235 



His Pro Val Ser Val 

2 A 0 



Val Leu Ser 3e 



r Leu Asn Ser Ser Ala Asr. Pro I 
245 250 



lie lie Tyr Phe Phe 

255 



Val Gly Ser Phe Arg Lys Glr. Trp Arg Xaa Gin 



>60 



265 



His Pro lie Leu Lys 

270 



Leu Ala Leu Gin Arg Ala Leu Gin Asp lie Ala 



2 75 



280 



Glu Val Asp His Ser 
285 



Gl u (7 1. y Cys 
2 0 0 



Arg Gin Gly Thr Arg Arg Phe 

29 5 



Lys Glu Ala Phe Trp 
300 



Cys Arg Asp Gly Pro Leu Tyr Phe His His Tie 
-Ki5 310 315 

Gly Asn Phe Ala 



Tyr Val Ala Leu Arg 
320 



1 10 

2 11 



5 9 
5 7 8 
12:- ON A 
1 3: ■ H . Sap i ens 



: 0 0 : 



i :t ttcicatct 


cactgtt gag 


cagacagcct 


gc t gaaa g 1 1 


g 


: c gctgacc 


a c c a c a t a t a 


bO 


q-.aaraggtt 


a ;caaaggt g 


ttcagagcag 


cat aatggt c 


t 


•gaaacgat. 


gtaagc*: tea 


120 


trjgat ctgat 


r. ::caat gga 


aca act gat t 


gaaagcaggc 




Tagat tcga 


tcctgaatga 


1 r. 0 


orct caagat 


a t ggaagggt 


aaaaaaca t a 


cgt aaaa tgc 


a 


aggagtagc 


agaatggtta 


2 A 0 


g c c t: t c g t g c 


tt-.ctgetta 


aggcagctgt 


c.3gt 1 1 gcag 


t 


::ca tgggt c 


aaagtgt. gga 


300 


raa t cgt ggt 


atagcaaagt 


gtcactatca 


ccaaggggag 


g 


•agaaagta 


ct tgcagt ca 


3 60 


ihiatcaqqct 


gtacoact ta 


atagtattga 


gtt ca t ccga 


a 


zt ggtgagg 


t cgagac a gg 


A 20 


cr.qat ctqtt 


ggt -ctgt tg 


gtt gatgtga 


tcaagaaggt 


ca t eggaatg 


acagctacca 


A HO 


g- gaaat gat 


c.ca-7-dcca.ca 


gca ca ggc t a 


c aact gcaca 


t 


rgagttttg 


tgaatggaaa 


3 A 0 


aqcaqct.cat 


tgggtgaatg 


atcacacagt 


a gcggaag 








578 



2 1 0< 

:21 !:■ 
, 2 1 2 : > 
212- 



60 

192 

PRT 

H . Sapiens 



Aj(< 



60 
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Phe Arg Tyr Cys Va 1 He He His Pic Met Ser Cys Phe Ser He His 
1 5 10 15 

Lvs Thr Arg Cys Ala Vai Va 1 Ala Cys Ala Vai Val Trp He He Ser 
20 : >r 30 

Leu Val Ala Val He Pro Met Thr Phe Leu He Thr Ser Thr Asn Arg 
3 5 4 0 4 5 

Thr Asn Arg Ser Ala Cys Leu Asp Leu Thr Ser Ser Asp Giu Leu Asn 
50 55 60 

Thr He Lvs Trp Tyr Asn Leu He Leu Thr Ala Ser Thr Phe Cys Leu 

6 5 " 7 0 7 5 8 0 

Pro Leu Val He Val Thr Leu Cys Tvr Thr Thr lie He His Thr Leu 

8 5 ' 90 95 

Thr His Gl y Leu Gin Thr Asp Ser Cys Leu Lys Gin Lys Ala Arg Arg 
100 * U5 110 

Leu Thr He Leu Leu Leu Leu Ala Phe Tvr Val Cys Phe Leu Pro Phe 
115 120 125 

H;n He Leu Arg Val lie Gin Asp Arg He Ser Ala Cys Phe Gin Ser 
-.3 0 " 135 14 0 

Val Val Pro Leu Arg He Arg Ser Met Lys Leu Thr Ser Phe Leu Asp 

105 150 155 160 

H:y Tyr Ala Ala Leu Asn Thr Phe G...y Asn Leu Leu Leu Tyr Val Val 
165 HO 175 

Val Ser Asp Asn Phe Gin Gin Ala VH Cys Ser Thr Val Arg Cys Lys 
180 135 190 



apiens 



i c : 


61 


<3 11 :■ 


8 7 0 


<:? H :■ 


DNA 


<0H: 


H . S 


< -10 0:- 


61 



60 
100 



gggauggetc gtagacacac taaccctacc otttctgttt cttcctca~c tttcctttcc 

aretcTtttct catggtctcc tgtctgtctc tctctctctc ccctctttct ctctcctcgc 

tctttctcat: cccctccatt tctgtgtcaa tctcaatcca tttatatcgg tggccasttt 

tctatctctt tgttctatct ctctctctct ctctttccca ctttgtctct qcacqcctqL 2 4 0 

latg-.ttttc tgcctgtctc tctcttgccc tcatctctct gtctctctct tgccctcutc 3(J0 

tctrgtctc tctgtgtctg tgtctccccc gctcattccc atttgcaggt gcaatgtagc 360 

aggacaactc atggagcccc c:cgggcc:a tcqagtaccg gactggctga ccccctaggg 

ttggoagtag cccctgaccc tcagtatgje caacactacc ggagagcct.g aggaggtgag 
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cggcqctctg tccccaccgt ccgcatcagc ttargtgaag ctggtactgc tggqactga' 540 

vatg* gcgtc agc:rggcqg gtaaogcca: ottctccctg ctggtgctca aggagcgggc 69C 

^ctgcacaag gct;cttact acttoctgct ggacctgtgc ctggccgatg gcatacgctc 66C 

•;acci;tci:gc t:cccctttg tgctggcttc tgtgcgccac ggctcttcat ggacc-tcag 720 

igcactcagc tgcaagattg tggcctttat ggccgtgctc ttttgcttcc atgcggcctt 780 

■ :atg<jtg*:tc tgcatcagcg tcacccgcta catggccatc gcccaccacc gcttctacgc 840 

oaagcgcatg acactctgga catgcgcggc zg Q~I2 

10:- 62 

•■: 12:- PRT 

■ : 2 1 3 : ■ H.SaDie.is 



400 




62 




























I(/t 


Ala 


Asn 


Thr 


Thr 
5 


Gly 


Glu 


Pro 


Glu 


Glu 

10 


Val 


Ser 


Gly 


Ala 


Leu 
15 


Ser 


l.O 


i'ro 


Ser 


Al a 

2 0 


Ser 


Ala 


Tyr 


Val 


Lys 
25 


Leu 


Val 


Leu 


Leu 


Gly 

3 0 


Leu 


lie 


i(-:t 


Cys 


Val 
3 5 


o e r 


Leu 


Ala 


Gly 


Asn 
4 0 


Ala 


lie 


Leu 


Ser 


Leu 
4 5 


Leu 


Val 


Leu 


j'/S 


Glu 
"i.O 


Arg 


Ala 


Leu 


His 


Lys 
55 


Ala 


Pro 


Tyr 


Tyr 


Phe 
60 


Leu 


Leu 


Asp 


Leu 


ys 


Leu 


Ala 


Asp 


Gly 


lie 
70 


Arg 


Ser 


Ala 


Val 


Cys 

75 


Phe 


Pro 


Phe 


Val 


Leu 

80 


il a 


Se r 


Val 


Arg 


His 

85 


Gly 


Ser 


Ser 


Trp 


Thr 

90 


Phe 


Ser 


Ala 


Leu 


Ser 
95 


Cys 




7 le 


Val 


Ala 
1 0 0 


Phe 


Met 


A La 


Val 


Leu 
105 


Phe 


Cys 


Phe 


His 


Ala 
110 


Ala 


Phe 


]t :t 


Leu 


Phe 
1 15 


Oys 


Tie 


Ser 


Val 


Thr 

120 


Arg 


Tyr 


Met 


Ala 


lie 
125 


Ala 


His 


His 




Phe 

: 30 


Tyr 


Ala 


Lys 


Arg 


Met 
L35 


Thr 


Leu 


Trp 


Thr 


Cys 
140 


Ala 


Ala 


Glu 




JlO 

212 
2 13 




63 

9n2 

DNA 

H . oapien 


























4 00 - 6 3 
iiaa^ttgc: 


gtac 


tgaact a 


ttga 


atgga acttggaaat 


aaagt ccct t 


c c a a c 


at aac 


at t 




cagagagtaa taggtaaat 


g ttttaqaagt 


gagaggac t c 


a a a 1 1 


gecaa 
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tgatttactc ttttattttt cctcctaggt ttctgggata agt.atgt.gca aataaaaaaL 1 f 0 

aaacatgaga aggaactgta aoctgattat ggatttggga aaaagataaa t caa cacaca 2<10 

:ttttgcc acaatataat 3 (' ( 

;gc:tccc tgtacagttt 3ts" 

igtta-tg tttccatatc 4 Si 

;tcca:qq ccactgtgga 4H 



b(..i 
6t.i 
7;. i 

9C" 
9>v. 
96; 



aagggaaaag 


taaactgatt 


ga :agcc 




aggaatg a t g 


Laat at t tec 


tgtgtg a aaa 


acaactggtc 


aaatgatgtc 


aat ggt get c 


at aat t ct ga 


ccacact 


-gt 


t ggcaa t rt g 


acact t caaa 


caact tcata 


c c ;caac 


3 a a 


tt ggctcat t 


ctttcttctg 


gggtgtctgg 


teatgee 


1 1 a 


cagtatg gtg 


gtat tt t gga 


gaagtcttct 


gtaaaa: 


tea 


cacaagcacc 


cccattttc 


catttglct: 


tcatctc 


;at 


t g a c c g c t a c 


gaga t a t aaa 


gc:aagatga 


at atct: 


3 gt 


latttgL gt g 


tgtccct get 


gtttr. t gca z 


Lt ggaa*. 


gat 


ctttctggag 


.-igagatatat 


ta ;aaa :atg 


ttcaotg 


;ag 


aggaggttgc 


atctggggta 


ctgaccttta 


t gact tc 


i tt 


ttatatacct 


at tacaga 


a La tat :tra 


tegctaa ica 


acaqgeaaga 



: J10 
:21 3 



64 
2 33 
PRT 

H . Sapiens 



:-100:- 64 

Arg Glu Lvs Thr Asp Gin Pro Ser Gly Met Met Fro Phe Cys His Asn 
■i * b 10 15 

lie lie Asn lie Ser Cys Val Lys Asn Asn Trp Ser Asn Asp Val Arg 

20 25 30 

A. a Ser Leu Tyr Ser Leu Met Val Leu lie lie Leu Thr Thr Lou Val 
3 3 4 0 4b 

Gly Asn Leu lie Val lie Val Ser lie Ser His Phe Lys Gin Leu His 

S3 b b i : y 0 

Tnr Pro Thr Asn Trp Leu lie His Ser Met Ala Thr Val Asp Phe Leu 

6 3 7 0 lb 8 0 

Leu Gly Cys Leu Val Met Pro Tyr Ser Met Val Arg Ser Ala Glu His 

8 5 90 9 3 

Cys Trp Tyr Phe Gly Glu Val Phe Cys Lys Tie His Thr Ser Thr Asp 
a U 0 10 5 U0 
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lie Met Leu Ser Ser Ala Ser lie Phe His Leu Ser Phe He Ser He 

115 120 12b 

Asp Arg Tyr Tyr Ala Val Cys Asp Pro Leu Arg Tyr Lys Ala Lys Met 

130 135 I AO 

Asn He Leu Val He Cys Val Met He Phe He Ser Trp Ser Val Pro 

1-4 5 150 155 160 

Ala Val Phe Ala Pne Gly Met He Phe Leu Giu Leu Asn Phe Lys Gly 

165 170 H5 

Ala Glu Glu He Tvr Tvr Lys His Val His Cys Arq Gly Gly Cys Ser 

180 185 190 

Val Phe Phe Ser Lys He Ser Gly Val Leu Thr Phe Met Thr Ser Phe 

195 200 20 5 

Tvr lie Pro Gly Ser He Met Leu Cys Val Tyr Tvr Arg lie Tyr Leu 

110 215 220 

He .Ha Lys Glu Gin Ala Arg Leu He Ser Asp Ala Asn Gin 

225 230 235 

■::U0 ■ 6 5 
211 ■ 1018 
2 12 • DNA 
213 H . Sapiens 

:-100 ■ 65 

.iacd':tcc:g ggtggaacct gggcatgtat a'rtttgattg ttttatgcat actcctagtg 60 

aaga,.-.ccaa t gtcttgctca gatagaagca agatactcag acttagtttc tctgtagotc 120 

:tgct.ttt-.a ttattcctgg ttggattgca ccactactca gtttctattt tataataot.g lfsO 

.iUaraaaac atgggaggga aataactttg tattggtttt tatggataat ttattatgtg 240 

:.cct.-gac-.c tggccttgtc aaaagaagga cgtaagaagg ca^gatgtat tatacttggg 300 

aatgatagaa gagactgacc tgqtatttcc acccggaaga gggaaaggat tttaactaea 3t>0 

lataoagg aa tccagcagat ggcatcagag aacactataa aaaagaaacg atttgcaaca '110 

jcca.xtcrc t'.ccaaaaca attccttact tctgtggtct gcaagqcggt tttttgaatq Ai- 0 

■jaao.gaaca tagtaatata ggaaaazaca atgatgagaa aagccagcaa gtt :a:a :ct 50(1 

gtig-iggaaa ag<:acacttt taacat:tca ggegtaaaag tcaacagtaa aatcactgtg 600 

gtac.iqgtrg agtatccctt acceaaaatg tttgaaacca gaaatgrt.tt ggavttcgga 660 

rttC'igaa:a tttacacatt cataatgata tatcttggaa atggtt--cca a at-:: t 3 a aca 720 

:aaa itttat ttatgtttca tatacacctt ar_aca;atag tctgaaagta attitgtaca 78 0 

jtat'.ttaaa taattttggg catgaaacaa agtttgcata caLtgaacca t:aga:agca 8-10 



aaagjttcag gtgtggaatt ttccacttgt ggcatcatgt tgatgctcaa aaa^t.tcc : 
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attttagagc atttcaaatt ttqgattttc aaattacaaa tgcttaacct gt acttagat 
gttaaata^a gtgcctcttc cacgggcact ttcaggaagc atLCLtttat ataugccc 

<710> 6 6 

■::>Il:- 327 

-.212 • PRT 

• : 1 3 : • H.Sapiens 

• : -I 0 0 : ■ 6 6 

T^r Tie Lys Glu Cys Phe Leu Lvs Val Pro Vai Glu Glu Ala Leu Tyr 
L"' 5 10 15 

L<ju Thr .it-r Lys Tyr Arg Leu Se r lie Cys Asr: Leu Lys 1 .1 e Gin Am 

2 0 2 5 3 0 

Leu Lys Cys Ger lys lie Trp Am Phe Leu Ser lie Asr. Met Mot Pro 
3 S 4 C -4 5 

Cm Val Glu Asn Ger Thr Pro G] u Ala Phe Ala Val Trp Phe Asn v c a 
SO 55 60 

Cys Lys Leu Cys Phe Mel Pro Lvs He lie Asn lie Val Gin Asn Vvr 

7 0 7 5 HO 

Phe Gin Thr Met Cys lie Arq Cvs Tie Asn lie Asn Lys Phe Cys Val 
3S 90 95 

Thr Trp Glu Pro Phe Pro Arg Tvr Tie lie Met Asn Val lie Phe Arg 
100 " 105 110 

Asn Pro Lvs Ger Lvs Thr Phe Leu Val Ser Asn lie Leu Gly Lys Giy 
115 " 120 125 

?vr Ger Thr Cys Thr Thr Val 1 .1 e Leu Leu Leu Thr Phe Thr Pro Glu 
130 135 140 

Met Leu Lys Val Cvs Phe Ger Pro Thr Gly Val Asn Leu Leu Ala Phe 
145 150 155 IbO 

Leu lit* Tie Val Phe Ser "yr lie Thr Met Phe Cys Ser lie Gin Lys 
165 170 175 

Thr Ala Leu Gin Thr Thr Glu Val Arg Asn Cys Phe Gly Arg Glu Val 
180 185 190 

Ala Val Ala Asn Arg Phe Phe Phe He Val Phe Ser Asp Ala He Cys 
195 200 20!, 

Trp He Pro Val Phe Val Val Lys He Leu Ger Leu Phe Arg Val Glu 
210 215 220 

Tie Pro Gly 31 a Ser Leu Leu Ser Phe Pro Ser He He His Arg A La 
725 230 23S ?40 

Phe Leu Arg Pro Ger Phe Asp Lys Ala Arg Val Asp Thr He He His 
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25 5 



Lys Asn Gin Tyr Lys Val He Ser Leu Pro Cys 
260 265 



Fhe He lie Ser lie 

270 



He Lys Lys Leu Ser Ser Gly Ala lie Gin Pro 
2^5 280 



:l.y lie He Lys Ser 
285 



Arg Ser Tvr Arg Glu Thr Lys Ser Glu Tyr Leu 
29C J 295 



Ala Ser He Ala Arg 

300 



His Trp Phe Phe Thr Arg Ser Met His Lys Thr 
3 ! ) 5 310 315 



He Lys He Tyr Met 

320 



Pro Arg Phe His Pro Gly Leu 
325 



: 2 1 0 
: :: 1 1 



: J13 
MOO 



1251 
DMA 

H. Sapiens 



.ictaccatqg 


aagctgacct 


gggtgcca ct 


ggccacaggc 


cccgcacaga 


get tgatgat 


6C 


gaggactcct 


acccccaagg 


tggct gggac 


acggtcttcc 


t g g t g g c c c t 


get get oct t 


1 2 ■: 


q.jgcT.gcc^g 


ccaatgggtt 


galggcgtgg 


ctggccggct 


eccaggcccg 


gcat ggagct 


1 P. 0 


M'icacgcgf.c 


tggegctget 


cot get cage 


ctqgccctct 


ctoact t c tt 


gtt cctggca 


2 -i 


goagoggcct 


tccagatcct 


agagatcegg 


ca tgggggac 


act ggccgct 


qgggacagct 


3u«: 


g. ;t(icc- : j:t 


tctactactt 


cctatggggc 


gtgtcctact 


cctccggcct 


-ttcctgctg 


3 • ; -o 




gectcqa :cg 


ctgect gc t g 


gcgetgt gee 


ea cactggt a 


-cctgggcac 




■ :< J :ccagt cc 


gcctgcccct 


etgggtctgc 


gee ggtgtct 


gggtgctgge 


:acactct t c 




-gt.gccct 


ggctgqt -tt 


ccccgaggc t 


geegt ctggt 


ggt acgacct 


ggt ca t ctgc 




ijt ggacr.ct 


gggacag :ga 


ggagctgtcg 


ctgagga t gc 


tggaggtcct 


ggggggcttc 




otgccttt cc 


tcctgctgct 


egtet gceae 


gtgctcaccc 


aggccacagc 


rtgtcgcacc 




■ .accaccgcc 


aacagcagcc 


cgeagectge 


eggggctteg 


cccgtgtggc 


ragga cca 1 1 




ct gtragoot 


atgtggtcct 


gaggctgccc 


t aceagctgg 


circagctgct 


otacctggcc 




r r ,;c« gtgqg 


acg t 'Ttaci. c 


tggct ace tg 


ctctgggagg 


c.:ctggtcta 


:t>";cgacLac 


i. ■* 0 


: :t ga t cctac 


t caaoagct g 


cctcaqcccc 


t t octet gec 


teatggeeag 


tgocgacctc 


( :1 (. ■ 0 


oggaccctgc 


t gcg :tccgt 


get ctegtee 


ttcgcggcag 


ctetctqega 


ggagcgqccg 




gg :<agct tea 


cgcccactga 


gccacaqacc 


cagetaga t t 


ctgagggtcc 


aaet ct gee a 


10 2 0 


■ j^gecgatge 


caqaggcerd 


gtcacagatg 


gatcccgtgg 


coca g cctca 


ggt qaacccc 


1 0 fi 0 
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acactcca^c cacgatogga tcccacagct cagccacagc tgaaccctac ggcccagcca 1140 
cagtcggatc ccacagccca gccacagctg aacctcatgg cccagocaca gtcagattct 1200 
grggcccagc cacaggcaga cactaacgLc cagacccctq cacctigctgc c 1251 



LL10 
■211 
1 2 
::":13 



6o 
4 1 7 
PP.T 

H. Sapiens 



S00:- 68 

:'hr Thr Ket Glu Ala Asp Leu Gly Ala Thr Gly His Arg Pre Arg Thr 



1 



10 



15 



'Liu Leu Asp Asp Glu Asp. Ser Tyr Pro Gin Gly Gly Trp Asp Thr Val 



Pho Leu '/a 1 Ala Leu Leu Lou Leu Gly Leu Pre Ala Asn Gly Leu Met 



4 0 



45 



la rp Leu Ala Gly Ser Gin Ala Arg His Gly Ala Gly Thr Arg Leu 
50 55 60 

la Leu Leu Lee Leu 3er Leu Ala Leu Ser Asp ?he Leu Phe Leu Ala 



A. a 7,1a Ala Phe Gin He Leu Glu Tie Arg His Glv Gly His Trp Pro 

9 5 10 95 

Leu Gly Thr Ala Ala Gys Arg Phe Tyr Tyr Phe Leu Trp Gly Val Ser 



10 5 



11C 



Ser Gly Leu Phe Leu Leu Ala Ala Leu Ser Leu Asp Arg Cys 



120 



12 5 



Leu Leu Ala Leu Cys Pro His Trp Tyr Pro Gly His Arg Pre Val Arc 
■30 135 140 

Leu Pro Leu Tip Val Gys Ala Gly Val Trp Val Leu Ala Thr Leu Phe 



145 



155 



1 60 



■rr Val Pro Trp Leu Val Phe Pro Glu Ala Ala Val Trp Trp Tyr Asp 



165 



170 



■:u Val He Cys Leu Asp Phe Trp Asp Ser Glu Glu Leu Sei Leu Arg 



18 0 



18 5 



1 90 



rt. Leu GJu Val Leu Gly Gly Phe Leu Pro Phe Leu Leu Leu Leu Val 

19 5 ' 2 0 0 ?. 0 5 

,'s His Val Leu Thr Gin Ala Thr Ala Cys Arg Thr Cys His Arg Gin 

210 215 220 

l.n Gin Pro Ala Ala Cys Arg Gly Phe Ala Arg Val Ala Arg Thr He 



2 30 



2 35 
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Leu Ser Ala Tyr Val Val Leu Arg Leu Pro Tyr Gin Leu Ala G^n Leu 
245 250 255 

Leu Tyr Leu Ala Phe Leu Trp Asp Val Tyr Ser Giy Tyr Leu Leu Trp 

260 265 270 

GLu Ala Leu Val Tyr Ser Asp Tyr Leu lie Leu Leu Asn Ser Cys Leu 

275 280 285 

Ser Pro Phe Leu Cys Leu Met Ala Ser Ala Asp Leu Arg Thr Leu Leu 

290 295 300 

Arg Ser Val Leu Ser Ser Phe Ala Ala Ala Leu Cys Glu Glu Arg Pro 
305 310 315 320 

G.y Ser Phe Thr Pro Thr Glu Pro Gin Thr Gin Leu Asp Ser Glu Gly 
325 330 335 

Pro Thr Leu Pro Glu Pro Met Ala Glu Ala Gin Ser Gin Met Asp Pro 
340 345 350 

Val Ala Gin Pro Gin Val Asn Pro Thr Leu Gin Pro Arg Ser Asp Pro 

355 360 365 

Thr Ala Gin Pro Gin Leu Asn Pro Thr Ala Gin Pro Gin Ser Asp Pro 

370 375 380 

Thr Ala Gin Pro Gin Leu Asn Leu Met Ala Gin Pro Gin Ser Asp Ser 

3H5 3 90 3 95 4 00 

Val Ala Gin Pro Gin Ala Asp Thr Asn Val Gin Thr Pro Ala Pro Ala 
■105 410 415 



:2 10 
:2 1 1 
:212 
:213 



69 

659 

DMA 

H . Sapiens 



■'400> 69 

tacaggcctg agcatgctgg gctccatcag caccaagca.; tgcctgtcca tcctgtggcc 60 

eatctagtac cgctgecacc accccacaca cctgtcagca gtcgtgtgtc ctgctctggg 120 

oci^tgtccct gctgcagagc atcctggaat ggatgttctg tggcttcctg tctagtggtg 180 

■vtgattctgt ttggtgtgaa acatcagatt tcatcacag: cacatggctg atttttttat 2 4 J 

gtgtggttct otgcgggtcc agcccggttc tgctggtcag gatcctttgt ggatcccgga 300 

ciqatgccctt gaccaggctg tacatgaoca tcctgctcag agtgctggtc ttcctcctct 36 J 

ntganctgcc ctttggcatt cagtgattcc :atttttctg qatccacqtg gatttgtcac 42 J 

■Tttcgtctag tttccatttt cct.gtccact cttaacagca gtgccaa::cc cattatttac 480 

ttcttcatgg gctcctttag gcagcttcaa aacaggaaga ctctctagot ggttctccag 540 
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agggctctgc aqqacacqcc *_gaggtggaa gaaggcagat 
Ciggagctgt cutgaagcag ar.tggggeea tgaggaagag 



ggcggctttc tgaggaaacc 600 
cctcLgccct gtcagrcag 659 



210 
211 



213 
•;00: 7 



7 0 
2 13 
PRT 

H . Sapiens 



Tyr Arg Pro Glu His Ala Gly Leu His Gin His 



Gin Ala Leu Pro Val 



His I to Val Ala His Leu Val Pro Leu Pro Pro 

20 2 1 : 

:;<--r ;"er Arq Val Ser Cys Ser Gly Pro Cys Pro 

Trp Asn ("; 1 y Cvs Ser Val Ala Ser Cy~- Lou Val 

:c ' 55 



His Thr Pro Val 

30 

Cys Cys Arg Ala Sor 



Val Lou lie Leu Phe 



Yal T.y, H] 



lie Ser Ser Gin Ser His Gly 



Phe Phe Tyr Val 

8 0 



Trp Phe Ser Ala Gly Pro Ala Arg Phe Cys Trp 



8 5 



90 



\sp Pro Gly Ara Cys Pro Pro Giy Cys Thr Pro 
10 0 105 

"rp :er Ser Ser Set Val Thr Cys Pro Leu Ala 



G,.y Ser Phe Val 
95 

Cvs Ser Glu Cys 
110 

Ser Asp Ser Tyr 
12 5 



Ser Gly 3^ r Thr Trp He Cys His Val Arg 
'.3 0 1 3 5 



Leu 
140 



Sou Ser Thr Leu Asn Ser Ser Ala Asn Pro He lie 



150 



155 



Gly Ser Phe Arg Gin Leu Gin Asn Arg Lys Thr 
165 1"'0 



Val Ser He Phe 



T'/r Phe Phe Met 
160 



Leu Val Leu Gin 

1 7 5 



Arg /-la Leu Gin Asp Thr Pro GLu Val G l u Glu Gly 



er Glu Glu Thr Leu Glu Leu Ser Ser Arq Leu Gly 

195 200 



Arg Trp Arq Leu 
190 

Pro Giy Arg Ala 
205 



ic;r Ala Leu Ser Val 
,■10 



:.2 10 ■ 7 2 

2 11 5 5 9 

:2 12 ■ DtJA 

;21 3 ■ H .Sapiens 
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<400> 71 

dtqccqaag] caggccgcag aagagaagag gaggaoggtg aggaggatga gcccagggaa 



60 



gccccggggt gggggccgct gggggcctcg ctccacccgc agcagcagca taaggctgg: 120 

c-:--ar.aca , i'j gtgcaaca:a gcagaq:cag ca gcaccgct gccaccagcc aiagcgtccg 1R0 

gcacaagtgg cggctgggct ccccgaagaa ctggg*:gcag gcgccgctga gcagcaggtg 210 

■jugcc-gcagg cagagggccc aggtgagggc gcaca>:acag gtggtcaggt ggcgtgggcg 300 

gcggcacgag taccaggctg ggaagagggc ggccaggcac tgctccacgc tgacggc:.gc 360 

'Zagqiigacz-z aggcccacga tgtagcagaa gaagcgcagc gttgccaggc tggtctg :ac 420 

gaagnccggg aagtccagcc ggccttgcag caagtcgggg acgatggcca ccatgtg jca 480 

gccaaggaaq atgagatccg cgcaggccac gtccaggagg tagatggcga aaggg*: z t ct 540 

c|i.ag«cattq gagctgagc 559 



:210 
211 
212 
212 

4 00: 



12 
2 11 
PRT 

H Sapiens 



Lou Scr Ser Asn Val Tyr Arg Asn Pro Phe Ala lie Tyr Leu Leu Asp 

5 10 15 

Val Ala Cys Ala Asp Leu lie Phe Leu Giy Cys His Met Val Ala He 

20 25 30 

Fro A:.-;p Leu Leu Gin Gly Arg Leu Asp Phe Pro Gly Phe Val Gin 
3!'.' 4 0^ 45 

7hr Ser Leu Ala Thr Leu Arg Phe Phe Cys Tyr Tie Val Gly Leu oer 
50 55 60 

Leu Leu A] a Ala Val Ser Val Glu Gin Cys Leu Ala Ala Leu Phe Pro 
t. r - 70 75 80 

Ala Trp Tyr Ser Cys Arg Arg Pro Arg His Leu Thr Thr Cys Val Cys 
8 5 90 9 5 

Al a Leu Thr Trp Ala Leu Cys Leu Leu Leu His Leu Thr Thr Cys Val 
100 105 110 

■Cys Ala Leu Thr Trp Ala Leu Cys Leu Leu Leu His Leu Leu Leu Scr 
1 j 5 12 0 12 5 

(_-: y Ala Cys Thr Leu Leu Leu Ser Gly Ala Cys Thr Gin Phe Phe Glv 
130 135 140 

Glu Pro Ser Arg His Leu Cys Arg Thr Leu Trp Leu Val Ala Ala Va 
14 5 150 155 160 
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Leu Leu Ala Leu Leu Cys Cys Thr Met Cys Gly 
165 170 



Ala Ser Leu Met Leu 
175 



Leu Leu Arg Va 1 Glu Arcj ^ly Pro Gin Arg Pro 
180 135 



Pro Pro Arg Gly Phe 

190 



Gly Leu lie Leu Leu Thr Val Leu Leu Phe 
195 200 



Ser Ser Ala Ala Cys 
205 



Leu Arg His 

1 0 

: o ■ 73 
02 11 ■ 1008 
12 ■ DNA 
1 3 H . Sapiens 



TO 



7 3 



, 1 1 gq. 


j a z e a t 


ctttcrcatL 


tggagtgiitc 


ct tgctqtcc 


tgijcc.ccct 


cat cattget 


60 


a eta, 


icacuc 


tagtggctgt 


ggcrgtg ;tg 


ctgt t ga t cc 


acaagaatga 


tggtgtcagt 


1 :■ 0 


■ :t.-:t 


icttca 


ccttgaat ct 


ggctgtggct 


gacaccttga 


t tqgtgtggc 


cat ctctggc 


1 5 0 


ct uci 


;acdq 


accagctct e 


.rage :ct t ct 


egge cca cac 


agaagaccct 


gtgcagcctg 


2 ,; -"i 


oqga- 


ggcat 


1 1 q t e a c 1 1 c 


ct :cgcagct 


gcctctgtcc 


t cacgg teat 


gctgat :acc 


■* ! 


tt.tg. 


3 jaggt 


aect t qccat 


caag lagccc 


tt ccgctact 


t ga agate at 


gaqtgggt tc 


:•• '• 




-gggg 


c:tgcattgc 


cgggctgtgg 


t tagtgt ctt 


ac ;tca t cgq 


ct t cc *_ ccca 


4.; ;= 




ia dt':c 


c:atgtt:cca 


gc=sg act gee 


t acaaagggc 


agtgcagct t 


c 1 1 1 gctg t a 


a i- :■ 


tt. tc 




a -ttegtgct 


ga-e-tctcc 


tgcg-tggct 


tc::cccagc 


cat. get :ctc 




tt tg< 


::tct 


tct ac t gega 


ca tg etc nag 


attgect cca 


tg :acagcca 


gcagattcga 


v, ■>■■■ 


,if.qa' 


gguuc 


atgeaggage 


catg get gga 


ggttatcgat 


cc teaeggac 


tcccagcgac 




tt ca. 


.agece 


t ;cgtactgt 


gt-tgtt ;tc 


attgggagct 


ttget ctatc 


ctggaccccc 


1.: ■:• 


t..cc* 


r a t 0 a 


ctqq:at.:gt. 


geaggtg gec 


tgocaggagt 


gt cacctctu 


cctagr get g 




q-.:ac' 


Tgtacc 


tgtggetgct 


eggegtg ggc 


aactccct gc 


t c lacocact 


catctatgcc 


8 4 0 


Kcit t- 


jgcaga 


aggaggtgcg 


actgeaget c 


taccaoa t gg 


ccc:aggagt 


gaagaagg t g 


900 


ot ca 


■ -teat 


iccLcctctt 


tctctcggoc 


aggaattgtg 


geccagagag 


gcccaqgga a 


9t.O 


• Kit t- 


■ctgtc 


acat cgtcac 


t a t ct ccago 


t c a g a g 1 1 1 g 


atggctdd 




1 0 0 B 


:::10 
:;n.i 


74 
• 335 
PRT 















'.13 
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< 4 0 0 > 7 4 

Ket Glu Ser Ser Phe Ser Phe Glv Val lie Leu Ala Val Leu Ala Ser 
1 5 10 15 

Leu Tie lie Ala Thr Asn Thr Leu Val Ala Val Ala Val Leu Leu Leu 

2 0 2 5 30 

He His Lys Asn Asp Gly Val Ser Leu Cys Phe Thr Leu Asn Leu Ala 
35 40 45 

Val Ala Asp Thr Leu He Gly Val Ala He Ser Gly Leu Leu Thr Asp 

50 55 60 

Gin Leu Ser Ser Pro Ser Arg Pro Thr Gin Lys Thr Leu Cys Ser Leu 
6 5 7 0 7 5 8 0 

Arg Met Ala Phe Val Thr Ser Ser Ala Ala Ala Ser Val Leu Thr Val 
85 90 95 

Met Leu He Thr Phe Asp Arg Tyr Leu Ala He Lys Gin Pro Phe Arg 
100 105 110 

Tyr Leu Lys He Met Ser Gly Phe Val Ala Gly Ala Cys He Ala Gly 
115 120 125 

Leu Trp Leu Val Ser Tyr Leu He Gly Phe Leu Pro Leu Gly He Pro 
130 135 140 

Met Phe Gin Gin Thr Ala Tyr Lys Gly Gin Cys Ser Phe Phe: Ala Val 
145 150 155 160 

Phe His Pro His Phe Val Leu Thr Leu Ser Cys Val Gly Phe Phe Pro 
165 170 175 

Ala Met Leu Leu Phe Val Phe Phe Tyr Cys Asp Met Leu Lys lie Ala 
180 18 5 190 

Ser Met His Ser Gin Gin He Arg Lys Met Glu His Ala Gly Ala Mo: 

195 200 205 

Ala Gly Gly Tyr Arg Ser Pro Arg Thr Pro Ser Asp Phe Lys Ala Leu 
I- 1 0 215 2 20 

Arq Val Ser Val Leu He Gly Ser Phe Ala Leu Ser Trp Thr Pro 

225 230 235 24 0 

Phe Levi lie Thr Gly lie Val Gin Val Ala Cys Gin Glu Cys His Leu 
245 25 0 255 

Tyr Leu Val Leu Glu Arg Tyr Leu Trp Leu Leu Gly Val Gly Asn Ser 

2 60 ' 2 65 27 0 

Leu Leu Asn Pro Leu He Tyr Ala Tyr Trp Gin Lys Glu Val Arg Leu 
275 280 285 

Gin leu Tyr His Met Ala Leu Gly Val Lys Lys Val Leu Thr Ser Phe 
290 295 ' " 300 
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Leu Leu Phe Lou Ser Ala Arg Asn 

3 0 5 310 


Cys Gly Pro 
315 


Glu Arq Pro Arg Glu 

320 




Scr Scr Cy 


s His lie V 
325 


a 1 Th t- T c± 


Ser Ser Ser 
3 30 


Glu Phe Asp G ] y 
3 3 5 




:210:> 7 5 

:211> 2137 
•:212> DMA 
■ .2 1 3;- H . Sapiens 












■'A 00:- 75 
aactggaagg 


gcagccgtct 


g ;cgcccacg 


aaca:cttct 


caagcartt 


gagtgaccac 


>'•> ") 


ggct t:gcaag 


ctggtggct g 


gecce^cgag 


t c ^eggget c 


tgaggcaegg 


ccg t ega c 1 1 


12') 


a a gcg t tg za 


tcctgtt ace 


t ggaga c::t 


c t g a g e t c t c 


acctgetact 


t et geeget g 


1 -";u 


cttct.g c a e a 


gagecegg g z 


gaggacccc t 


ccaggatg:a 


ggtcccgaa: 


ageaeegg cc 


2 -M 


cgqaca acgc 


gaegetge 3-3 


aigctgcgga 


ac::ggcgat 


eg eg gtggc: 


tgecegt gg 


3 I HI 


t gtact „ get 


ggtggcgg-g 


g-.:agoai::c 


eggg :aacct 


;ttctctctg 


t'-f'^t gcegt 


3 n l ! 


g:; eg gcg- at 


gggg-Odga 


t — ecgt -gg 


teat ;ttC3t 


gate a acetg 


age gt raegg 


A . 1 


a :ctga: get 


ggccagcgt g 


trgco-ttec 


aaat :taci:a 






■■; .' ■ 1 


gggtattegg 


ggtgctgctt 


tgeaa -g -gg 


t gae zgtq-jc 


2 1 1 1 1 a "gc a 


a acatgta 1 1 


5 3 f 1 


^ :agcaL cc t 


cacc atgac ^ 


tgtat ragcg 


tggageget t 


: zt 9'5 , 3 i 3'3 ^ 


e t gtacccgc 


'[.' 


r .:dgct:c3a 


gcgcrggcgc 


egcegteg-t 


acgcggtggc 


c gege gt gca 


gggacet gge 


*J '.M.I 




gaeege :ctg 


tccccgcegg 


cgege acega 




cnggtgcacg 


7 20 


■:cc:cgg:.a: 


catcacctg-: 


ttcgaegt-c 


t :aagtggsc 


gatgete jcc 


a g cgt g gee a 


7 ( ■ 


-g -gggcrgt 


qttc:t ; 


accat c: t ea 


t :ctg.;tgt: 


ccr.cat cocg 


f.cgtgatca 


8 A 0 


■;cgrag,::.:g 


1 1 a ca egg cc 


accatcctca 


agctgttgcg 


racg gaggag 


g egeaeggee 


9 1" ( 


ggg.igcagcg 


gaggegegeg 


g-gggectgg 


cegcggtggt 


it tgctggee 


tttgtcacct 


!>»'-(: 


gctrcgcccc 


ca a caaot *_ e 


gtge'.ectgg 


cgeaeat cgt 


ga geegee-.g 


1 1 ct aeggca 


102( 


a gage tact, a 


ccacgtgtdc 


aagc:cacgc 


t gegtct cag 


et gcet ca a e 


aactgtctgg 


1 OtO 


accegt-rgt 


t ta-.-.,3 ;t:: 


gcgtcecggg 


aa*:t :eaget 


gegectgegg 


gaatatt-gg 


1 14 0 


gcegc eg :jg 


ggtgcc :aga 


gacaccctgg 


acacgegceg 


egagagee \c 


t *:ct cegeca 




ggacc aegtc 


cgtgcg;tcc 


gaggceggig 


cgcaccctga 


agggatggag 


ggageea ccd 


1 2: " 1 C 


ggee egg cct 


ccagag gcag 


gagagegtgt 


t ct gagt cee 


gggggcg :ag 


ct_ t ggaga gc 


1 3 2 ( : 


cgggcgcgca 


get r ggagga 


tecaggggeg 


catggagagg 


reaeggrg -c 


agaggt tcag 


133 0 


ggcigaacagc 


tgcgttgcte 


ccaggcactg 


cagaggcccg 
Page 


gtgggga agg 

A8 


gtct ecaqge 


1 4 AO 
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t ttattcct c 


ccaggcactc 


cagaggca :c 


ggtgaggaag 


ggtctc^agg 


cttcact cag 


1500 


ggta^jagaaa 


caagcaaagc 


ccagcagc :jc 


acaggg t get 


tgttatrctg 


cagaggg tgc 


15 60 


ctct gcctct 


ctgt gt cagg 


ggacagct t g 


tgtcaccacg 


cccggctaat 


ttt tgtattt 


162 0 


itctragtag 


agctgggctg 


t cacccccga 


get cct t aga 


cactcctcac 


acct giccat 


1680 


acccgaggat 


ggat at t caa 


ccagccccac 


cgcctacccg 


ac teggt 1 1 c 


t ggatatcct 


1740 


ctgtgggcga 


act gcgagcc 


ccattcccag 


ctcttctccc 


tgetgacate 


gtcccttagc 


1800 


■ icacctgtcc 


atacccgagq 


a t ggata 1 1 c 


aaccagcccc 


accgccracc 


egacteggt t 


18 60 


ictgqatatc 


ct ct gt gggc 


gaac tgcgag 


ccccattccc 


a g ctct t etc 


ccr.gctgaca 


192 0 


-.cgtccctta 


gtt gtggttc 


tggccttctc 


cat tctcctc 


caggggt t ct 


ggtctccgta 


198 0 


gixctjgtgca 


cgccgaaatt 


tctgtttatt 


t cactcaggg 


gcactgtggt 


tgctgtggtt 


2 04 0 


ggaattcttc 


1 1 1 cagagga 


gcgcctgggg 


ctcctgcaag 


tcagctactc 


tccgtqccca 


2100 


-.;*; t cccctca 


cacacacacc 


cccctcgtgc 


cgaattc 






2137 



•:210> 76 

■:2 1i> 35 9 

-:::i2> PRT 

•2!13> H. Sapiens 

■:A00> 16 

Met Cln Val Pro Asn Ser Thr Gly Pro Asp Asn Ala Thr Leu Gin Met 
15 10 15 

Leu Arg Asn Pro A] a lie Ala Val Ala Leu Pro Val Val Tyr Ser Leu 

2 0 2 5 3 0 

Val Ala Ala Val Ser lie Pro Gly Asn Leu Phe Ser Leu Trp Val Leu 
35 40 45 

Cys Arg Arg Met Gly Pro Arg Ser Pro Ser Val Tie Phe Met lie Asn 
50 55 60 

Leu Ser Val Thr Asp Leu Met Leu Ala Ser Val Leu Pro Phe Gin Tie 
*,5 7 0 7 5 8 0 

Tyr Tyr His Cys Asn Arg His His Trp Val Phe Gly Val Leu Leu Cys 

8 5 90 9b 

Asn Val Val Thr Val Ala Phe Tyr Ala Asn Met Tyr Ser Ser lie Leu 
100 105 110 

Thr Met Thr Cys lie Ser Val Glu Arg Phe Leu Gly Val Leu Tyr Pro 

115 123 125 

Leu Ser Ser Lys Arg Trp Arg Arg Arg Arg Tyr Ala Val Ala Aid Cys 
13 0 13 5 14 0 
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Ala Gly Thr Trp Leu Leu Leu Leu Thr Ala Leu Ser Pro Leu Ala Arg 
14 5 150 155 160 

Thr Asp Leu Thr Tyr Pro Val His Ala Leu GLy He He Thr Cys Phe 
165 170 175 

Asp Val Leu Lvs Trp Thr Met Leu Pro Ser Val Ala Met Trp Ala Val 

1B0 18 5 190 

Phe Leu Phe Thr He Phe He Leu Leu Phe Leu He Pro Phe Val He 
195 200 205 

Thr Val Ala Cys Tyr Thr Ala Thr He Leu Lys Leu Leu Arg Thr Glu 

210 215 220 

Glu Ala His Gly Arg Glu Gin Arg Arg Arg Ala Val GJy Leu Ala Ala 
22 5 2 30 2 35 2 4 0 

Val Val Leu Leu Ala Phe Val Thr Cys Phe Ala Pro Asn Asn Phe Val 
2 4 5 2 5 0 2 5 5 

Leu Leu A La JUs lie Val Ser Arg Leu Phe Tyr Gly Lys Ser Tyr Tyr 
260 265 270 

Mis Val Tyr Lvs Leu Thr Leu Cys Leu Ser Cys Leu Asn Asn Cys Leu 

27 5 28 0 2 35 

Asp Pro Phe Val Tyr Tyr Phe Ala Ser Arg Glu Phe Gin Leu Arg Leu 
2 90 2 95 30 0 

Arg Glu Tyr Leu Gly Cys Arg Arg Val Pro Arg Asp Thr Leu Asp Thr 
U)5 ' 310 315 32 0 

Arg Arg Glu Ser Leu Phe Ser Ala Arg Thr Thr Ser Val Arg Ser Glu 

32 5 3 30 335 

Ala Glv Ala His Pro Glu Gly Met Glu Gly Ala Thr Arg Pro Gly Leu 
3 4 0 34 5 350 

Gin Arg Gin Glu Ser Val Phe 
35 5 

■:2L0.- 77 

2 H : ■ 1 197 

2L2> DMA 

2 13: H . Sapiens 

■ : 4 0 0 > 7 7 

atggagt.cgg ggctgctgcg gccggcgccg gtgagcgagg tcatcgtcct gcattacaac 60 

"acaccgg la agct.ccgcgg tgcgcgctac cagccgqqtg ccgqcctgcg cqccgacgcc 120 

gtggtgrgro tggcggtgtg cgcottcatc gtgctagaga atctagccgt gttgttggtg 180 

ctcggacgec acccgcgctt ccacqctccc .atgttcctgc tcctgggcag cctcacgttq 24 0 

toggatctgc tqgcaggcgc cqcctacgcc gocaacatcc tactgtcggg grcgctcacg 300 
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ctgaaactgt cccccgcgct ctggttcgca cgggagggag gagtcttcgt ggcactcact 360 

gcgtccgtgc tgagcctcct ggccatcgcg ctgqagc :ca qxtcaccat ggrrgcgcag:: 2 j 

gggcccgcgc c:gt:rccag tcgggggcgc acgctggrga tggcagccg: ggcct gggg ~ 4 HO 

]tgt.cgctgc t :ctcgggct cctgccagcg ctgggctggs aitgcctggg tcgcctggac 540 

gcttgctcca ctgtcttgcc gctctacgcc aaggcctacg tgctcttctg cgtgctcgc: 600 

ttcgtgggca tcctggccgc tatctgtgca ctctacgcgc gcatctactg c:aggtacgc 660 

gccaa :gcgc ggcgccigcc gg:acggccc gggactgcgg ggaccacctc gaeocgggcg ' ? 20 

■jgtcgcaagc cgcgcrcgct ggccttgctg cgcacgctca g-rgtggtgct cctggccttt 780 

ftggcatgtt ggggccccct cttcctgctg etgttgctcg acgtggcgtg c>:cggcgcgc R40 

acctgtcctg tactcctgca ggocgatccc ttcctgggac tggcoatggc caactcact: 9 (.1 0 

ctgaacccca tcatctacac gctcaccaac cgcgacctgc g:cacgcgct cctgcgcctg 960 

gtctgctgcg gicgccactc ctgcggcaga gacccgagtg go t cccagca gtcggcgagc 1020 

gcggctgagg ctt ccggggg cctgcgccgc tgcctgcccc cgggccttga tgggagcttc 1080 

jgcggctcgg agcgctcatc gccccagcgc gacgggctgg acaccagcgg ciiccacaggc 1140 

agccccggtg cacccacagc cgcccggact ctggtatcag aaccggctgo agactga 1197 

-:210 * 78 
•:21l> 398 
•:212 ■ PRT 
•213:- H.Sap L ens 

■A00-- 78 

Met. Glu Ser <51y Leu Leu Arg Pro Ala Pro Vai Ser Glu Val I^e Val 

1 5 10 15 

Leu His Tyr Asn Tyr Thr Gly Lys Leu Arg Gly Ala Arg Tyr Gin Pro 

2 0 2 5 3 0 

Gly Ala Gly Leu Arg Ala Asp Ala Val Val Cys Leu Ala Val Cys Ala 

35 40 45 

The lie Val Leu Glu Asn leu Ala Val Leu Leu Val Leu Gly Arg His 

50 55 60 

Pro Arg Phe His Ala Pro Met Phe Leu Leu Leu G y Ser Leu Thr Leu 

f-b 70 75 80 

L'er Asp Leu Leu Ala Gly Ala Ala Tyr Ala Ala Asn He Leu Leu Ser 

8 5 90 9 5 

Gly Pro Leu Thr Leu Lys Leu Ser Pro Ala Leu Trp Phe Ala Arg Glu 

ioo 105 ;:o 
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Gly Glv Val Phe Val Ala Leu 
115 

He Ala Leu Glu Arg Ser Leu 

136 135 

Val Ser Ser Arg Gly Arg Thr 

14 5 150 

v'dl Ser Leu Leu Leu Gly Leu 
165 

Gly Arq Leu Asp Ala Cys Ser 
180 



Thr Ala 
1 ;. 0 



ier Val Leu Ser Leu Leu Ala 

i;i5 



TKr Met 



A. la Arc; Am Gly Pro Ala Pro 
14 0 



Leu Ala Met Ala Ala A- a Ala Trp Gly 
155 160 



Leu Pro 



Thr Val 
185 



Ala Leu Gly Trp Asn Gys Leu 

: 7 0 17 5 

Leu Pro Leu Tyr Ala Lys Ala 
190 



Tyr Val Leu Phe Cys Val Leu 
195 

Cys Ala Leu Tyr Ala Arg He 

Old 215 

Arg Leu Pro Ala Arg Pro Gly 

2 2 5 2 3 0 

Arg Arg Lys Pro Arg Ser Leu 



Ala Phe 
200 



Tyr Cys 



Thr Ale 



Val Gly He Leu Ala Ala He 
2(3 5 

Gin Val Arg Ala Asn Ala Arg 

2 20 

Gly Thr Thr Ser Thr Arg Ala 

235 240 

Leu Arg Thr Leu Ser Val Val 

250 2 55 



Leu Leu Ala Phe Val Ala Cys 
260 



Trp G . y 
2 6 5 



Pro Leu Phe Leu Leu Leu Leu 
27 0 



jeu Asp Val Ala Cys Pro Ala 
2 7 5 



Ara Thr 

2H0 



Cys Pro Val Leu Leu Gin Ala 



Asp> Pro Phe Leu Gly Leu Ala 

2 90 2 95 



Met Ala 



Asn Ser Leu Leu Asn Pro He 

30 0 



He Tyr Thr Leu Thr Asn Arq 

305 310 

Val Cys Cys Glv Arq His Ser 

3 2 5 



Asp Leu Arg His Ala Leu Leu Arg Leu 

"315 32 0 

Cys Glv Arg Asp Pro Ser Gly Ser Gin 

330 ' 335 



Gin Ser Ala Ser Ala Ala GH 
34 0 

Pro Pro Gly Leu A Bp Gly Ser 

355 

Gin Arg Asp Gly Leu Asp Thr 

370 375 

Pro Thr Ala Ala Arg Thr Leu 

3 8 5 3 90 



Ala Ser Gly Gly Leu Arg Arg Cys Leu 
345 350 

Phe Ser Gly Ser Glu Arg Ser Ser Pro 
360 365 

3er Gly Ser Thr Gly Ser Pro Gly Ala 
380 



Val Ser 



lu Pro Ala Ala Asp 

3 95 



::210 
::211 
■ 212 
<2 1 3 



79 

104 1 
DNA 

H . Sapiens 
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<4 00:> 7 9 

atqUcaacg ggtogtgctg ccgcatcgag ggggacacca tctcccaggt gatgccgccg 



ctgctcattg tggcctttgt 

tgct'jxaca tgaagacctg 

gatticctcc ttatgatctg 

tgggcttttg gggacattcc 

gggaqcatcg tgttccttac 

■ :arcacgcag tgaacactat 



gctgggcgca ccaggcaatg gggtcgccct gtgtggttt2 120 

gaagcccag: a:tgtttacc ttttcaattt ggccgtggct 180 

cctgcctrtt cggacagact attacctcag acgtagacac 2*4 0 

ctgccgagtg gggctcttca cgttggccat gaacagggcc 3U0 

ggtggtggct gcggacaggt atttcaaagt ggtccacccc 3*j0 

ctccacccgg gtggcggctg gcatcgtctg caccctgtgg 420 

gccc:.ggtca t^ctgggaac agtgtatctt ttgctggaga accatctctg cgtgcaagag 4H0 

,i.:ggccgtct c:tgtgagag cfccatcatg gagtcggcca atggctggca tgacatcatg 5-iO 

ttccjgctgg agttctttat gcccctcggc atcatcttat tttgctcctt caagattgtt 600 

7 gagcct ga ggcggaggca gcagctggcc agacaggcto ggatgaagaa ggcgacccgg 660 

r.tcarcatgg tggtggcaat tgtgttcatc acatgctacc tgcccagcgt gtctgetaga 720 

otct at ttcc tctggacggt gccctcgagt gcctgcgatc cctctgtcca tggggccctg 780 

c 'icat aaccc tcagcttcac ctacatgaac ageatgetgg atcccctggt gtattatttt SAO 

toaagcccct cctttcccaa attcLacaac aagctcaaaa tetgeagtet gaaacccaag 900 

cagc.'aggac aotcaaaaac acaaaggecg gaayagatgc caatttcgaa cctcggtcgc 960 

aggaiittgca tcagtgtggc aaatagtttc caaagecagt ctgatgggca atgggatccc 1020 

c-i ja'.tgttg agtggcactg a 1041 

• : 2 1 0 : ■ 8 0 

■:211 • 346 

•:2 12 • PRT 

•:213 • H. Sap Lens 

•:-;00 • 8 0 

Met Tyr Asn Gly Ser Cys Cys Arg lie Glu Gly Asp Thr lie Ser Gin 
15 10 15 

Val Mot Pro Pro Leu Leu lie Val Ala Phe Va 1 Leu Gly Ala Leu Gly 

2 0 2 5 3 0 

Asn Gly Val Ala Leu Cys Gly Phe Cys Phe His Met Lys Thr Trp Lys 
35 AO 4b 

Pro Ser Thr Val Tyr Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu 

5 0 5 5 60 

Met lie Cys Leu Pro Phe Arg Thr Asp Tyr Tyr Leu Arg Arg Arg His 
65 70 75 80 
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Trp Ala Phe Gly Asp lie Pro Cys Arg Val Gly Lhu Phe Thr Leu Ala 
8 5 90 95 

net Asn Arg Ala G.y Ser lie Val Phe Leu Thr Val Val Ala Ala Asp 

10 0 10 5 110 

Arq Tyr Phe Lvs Val Val His Pro His His Ala V,il Asn Thr He Ser 
115 ' 120 125 

Thr Arg Val 7iia Ala Glv He Val Cys Thr Leu Trp Ala Leu Val .le 

130 ' 135 140 

Leu Gly Thr u al Tyr Leu Leu Leu Glu Asn His Lou Cys Val Gin Glu 

-.45 HH 155 160 



Thr Ala Val Cvs Glu Ser Phe He Met Glu Sor 

10 5 1^0 

His Asp lie net Phe Gin Leu Glu Phe Phe Met Pro 
130 185 

Leu Phe Cys :';er Phe Ly^ Tie Val Trp Ser Leu Arg 

19 5 2 0 0 

Leu Ala Arg Gin Ala Arc Met Lvs Lys Ala Thr Arg 

210 215 220 



Glv Trp 
17 5 



Leu Gly 
190 

Arg Arg 

205 



Gin Gin 



Phe He Met. Va! 



Val Ala lie v.il Phe 1H "hr Cys Tyr Leu Pro Ser Val Ser 



22 5 

>>u Tyr Phe Leu Trp* 
2-15 



235 

Thr Val Pro Ser Ser Ala Cys 
250 



Ala Ara 

2 A 0 



Asp Pro Ser Val 



Glv Ala Leu H^s Ho Thr Leu Ser Phe Thr Tyr 



260 



2 65 



Met Asn 

270 



Leu Asp Pro Leu V,-i 1 Tyr Tyr Phe Ser Ser Pro Ser 



Phe Pro 
2-3 5 



Lyr Phe 



Asn Lys .^eu Lys He Cys Ser Leu Lys Pro Lys 



Gin Pro Gly His 



190 



295 



300 



S--r Lys Thr Gin Arg Pro Glu Glu Met Pre He Ser A?n 
li5 31 0 315 

Arg Ser Cvs lie Ser Val Ala Asn Ser Phe Gin Ser Gl.n Ser 
325 330 



Gly Ar<i 

32 0 



Asp Gl^ 
335 



Asp Pro His He Val Glu Trp His 
340 345 



!10- 81 



11 

■HI 2 
■'213 



2 5 2 5 
DNA 

H . S a p i ens 



^4 00:- 81 

_:aagaaLgac aqqtqactto .:caaqtatgc ctggccacaa tac 

Page 5 4 
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gcgatcctat agtgacaccc ca;ttaatca gccictactt catagtgctt attqgcgggc 120 

tggtgggtgt catttccatt cttt tcctcc tggtgaaaat gaacacccgg tcagtgacca 180 

ccatggcggt cattaacttg gtggtggtcc acagcgtttt tctgctgaca gtgccatttc 2A0 

gcttgaccta cctcatcaag aagacttgga tgtttgggct gcccttcrgc aaatttgtga 30 0 

gtgccatqct gcacatccac atgtacctca cgtt^ctatt ctatgtggtg atcciggtca 3-jO 

ccagatacct catcttcttc aagtgcaaag acaaagtgga attctacaga aaactg:atg -1."!0 

ctgtggctgc cagtgctggc atqtggacgc tggtgattgt cattgtggta cccctggttg 4H0 

tctcccggta tggaatccat gaggaataca atgaggagca ctgttttaaa tttcacaaag 540 

agcttgctta cacatatgtg aaaatcatca actatatgat agtcattttt gtcatagccg t'iOO 

tt jctgtgat tctgttggtc ttccagqtct tcatcattat gttgatggtg cagaagctac »3*i0 

gc-;actcttt actatcccac caggagttct gggctcagct gaaaaaccta ttttttatag ~7.:0 

gggtcatcct tgtttgtttc cttccrLacc agttctttag gatctattac rtgaatgttg *? -"i 

tgacgcattc caatgcctgt aacag:aagg ttgcatttta taacgaaaLc ttcttgagtg h -1 1 1 

taacagcaat tagctqctat gatrrgcttc tctttgtctt tgggggaagc cattggttta ^on 

agoaaaagat aattggctta tggaattgtg ttttgtgccg ttagccacaa actacagtat 

tcatatttgc ttcctttata ttgggaataa aaatgggtat aggggaggta agaatggtat 10! i"j 

ttoattactt gatcaaaacc atgccttqat gtacccaaaa caaaaggact ataaaatgca IC^'mi 

agagccctca ttgtagtcct tatgggatcc ctcccatctc tgagtgatgg ccgtacaaag 1 .1 4 n 

accagtgttg ttgaatccac ctggagttgc aatattacat tattttccag tacagaatgt lT'i.-i' 

ctgtgtggcc catgaaagca acataggttt taagagtttt agagtitcat tagctcattc lU»:-u 

taagttcctc tgtttgaagc atggtctctt aggttttgga ctgaactcag acctttagtt 1220 

cttttcatcc cactrcacct taggtaagta aattctggcc accacccagc tccaaagaca 

ca-iactctcc UcgcLaacc aggttagatg tcccattcat ctcatgccct gataaaaac: 1 -I A i ) 

ga^aaqggga gagaatagtt aaaaattttt ctag<]gtaLc ataactctgg taggaagt.ca 1!': : 0 

:c.gtc:aga aaLcaagaga aaaagaacgt gtggcctcct gttataacaa gggtttct.ag l L >--i< 

at *: tgtcctg tgaaaggtcg tttaaggact tggggatcaa cttccLcaat. tatcaccaat l^:.n 

tgcactgttg ctccaaaaat catttaaaag cttactggac atatctacat aatgqtgaaa lc'-bf.' 1 

ctgtaattta gagactatcc cigactaatg tgctggtagg cattaaaatg aqttcccaag 1 4 0 

ggaagtgatt aaaatttttt t ctrt. t c tgt tttttgagag aatttctaga tgtcctgggc 
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cacagttaat 
1 1 ccgccgt t 
a t'. g cayad t g 
gtca-acccat 
tgqacaggcc 
t c ccacttct 
gaatga tggt 
aagaqgact t 
g c a t g a a g a a 
aaaatcaraa 
gg a tgaaa t a 
oacaiqtacc 
tcttt 



taagat tttt 
aaaattatat 
tgcaggtttg 
ctacattagg 
ccatt gt gtg 
aaqtgagaac 
ttccaggtta 
ttgagaagta 
gqt 1 1 1 ctga 
z cct ttaaaa 
atictgtacaa 
cct: gaa c tt a 



a ggggggaca 
atatatatat 
t laca taggt 
tatttctcct 
atgttcccct 
at gcggt g 1 1 
aaatta t a t a 
t at agaaaaa 
atatttgaag 
t at aggaaaa 
caaactccca 
aaataaaatt 



gaaagttata 
ttaaattata 
at acacgtgc 
aat get ctcc 
ccct gtgt cc 
tggt 1 1 tct g 
tt 1 1 taaat a 
ccat t aatt t 
agt g ga taaa 
at aact aat g 
tgacacatgt 
t a a a gtataa 



ct gaa at ct t 
ccttaagttc 
catggtqgtt 
ctcccctag z 
a t gtgt 1 1 1 z 
ttcctgtgt: 
aatgaaaac: 
agactctgt g 
t a aat gt cc z 
ggaact agg z 
1 1 acc t atg t 
t a a t a a a a t a 



t a g a g c t c c c 
t ggggtaca t 
tgcgqcacct 

attgttcaac 
agtttgctga 
gtgtttttaa 
agattaggtt 
ccaaagcaat 
ttaatactcg 
aai;:aaacctg 
atatggattt 



18 G 
192 

1 98 
2<K 

2 1 C: 
21ii 

22 H 

2 3<; 

■Ad 
4 ti 

52b 



■:210 ■ 82 

•211 • 312 

■:2\2 ■ PRT 

■ : 2 1 3 ■ H . Sapiens 

•:4 00 ■ 8 2 

net 7hr Gl y Asp Phe 
1 5 

Ser :1er Cys Asp Pro 
2 0 

lie Val Leu Tie Gly 
3 5 



Leu Val Lys Met Asn 
5 0 

Leu \ r al Val Val His 

G 5 

Thr Tyr Leu lie Lys 

8 5 

Phe Val Ser Ala Met 
1 00 

Tyr Val Val lie Leu 
115 



Pro Ser Met 



le Val Thr 



C 1 y Leu Val 
40 



Thr Arg Ser 
55 

Ser Val Phe 
70 

Lys Thr Trp 

Leu His lie 



Val Thr Arg 

120 



Pro Gly His 
10 

Pro His Leu 
25 

Gly Val lie 



Val Thr Thr 



Asn Thr Ser Arg Asn 
15 

lie Ser Leu Tyr Phe 

30 

Ser lie Leu Phe Leu 
4 5 

Met Ala Val lie Asn 
60 



Asp Lys Val Gin Phe Tyr Arg Lys 



Leu Leu Thr 

75 

Met Phe Gly 

9 0 

His Met Tyr 
105 

Tyr Leu lie 



Leu His Ala 
Page 



Val Pro Phe Arg Leu 
8 0 

Leu Pro Phe Cys Lys 
9 5 

Leu Thr Phe Leu Phe 
110 

Phe Phe Lys Cys Lys 
125 

Val Ala Ala Ser Ala 

5 6 
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130 135 14 0 

Cly Met Tro Thr Leu Val lie Val He Val Val Pro Leu Val Val Ser 
1-15 * 150 155 160 

Arg Tyr Gly He His Glu Glu Tyr Asn Glu Glu His Cys Phe Lys Phe 
165 170 175 

His Lys Glu Leu Ala Tyr Thr Tyr Val Lys He lie Asn Tyr Met lie 
180 185 190 

Val He Phe Val He Ala Val Ala Val He Leu Leu Val Phe Gin Val 

L95 200 205 

Phe He He Met Leu Met Val Gin Lys Leu Arg His Ser Leu Leu Ser 
210 215 2 2 0 

Kis Gin Glu Phe Trp Ala Gin Leu Lys Asn Leu Phe Phe He Gly Val 
22 5 2 30 2 35 24 0 

lie Leu Val Cys Phe Leu Pro Tyr Gin Phe Phe Arg He Tyr Tyr Leu 
2 45 250 255 

Asn Val Val Thr His Ser Asn Ala Cys Asn Ser Lvs Val Ala Phe Tyr 

260 265 270 

Asn Glu He Phe Leu Ser Val Thr Ala lie Ser Cys Tyr Asp Leu Leu 
275 280 285 

Leu Phe Val Phe Gly Gly Ser His Trp Phe Lys Gin Lys He He Gly 

2 90 2 95 300 

Lou Trp Asn Cys Val Leu Cys Arg 
30 5 * 310 

-:210 • 83 

-211 • 1125 

-:212 * DNA 

■ :2 1 3 • H . Sapiens 

■MOO • 83 



gcagga gca c 


tgaaaatcag 


gaacaa t cct 


gtatt ttttg 


t gat aatcaa 


caaggacaaa 


t"0 


dCt:otccat 


a tgtaaat aa 


cagegt tatg 


agcagcaatt 


catccctgct 


ggtggctgtg 


120 


Paget', gt get 


aegegaacg t 


gaa t gggtoc 


tgtgtgaaaa 


t cccct tctc 


gccgggatcc 


ifo 


.;qgq"gattc 


tgtacatagt 


gt ttggcttt 


ggggctgtgc 


t g g c t g t g 1 1 


tggaaacct c 


240 


ctggt gatga 


1 1 1 c a a t c c t 


ccatttca^g 


cagctgcact 


ctccgaccaa 


1 1 1 tct cgtr. 


300 


gectet ctgg 


cct gcgctga 


t t tcttggtg 


ggtgt gactg 


t gat gecett 


cagca t ggt c 


3^0 


aggacggtgg 


agagctgetg 


gt attttggg 


aggagttttt 




cacctgctgt 


4 20 


gatgt ggcat 


tttgt t actc 




cacttgtgct 


tcatctccat 


cgacaggtac 


4 yo 


-itt gcggtta 




ggtctatcct 


accaagttca 


cegtatet gt 


gtcaggaatt 


540 








Page 


57 
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tgcatcagcg tgtcctggat cctgcccctc atgtacagcg gtgetgtgtt ctacacaggt 600 

gtctatgacg atgggctgga ggaattatct gatgccctaa actgtatagg ajgttgtcag 660 

accg'itgtaa atcaaaactg ggtgttgaca gatttictat rcttctttat a ;ct acct t z 7 2 0 

attat:gaua Ltctg^atgg taacaLattt cttgtggcta gacgacaggc gaaaaagata 



agagagagaa aagcagciaa aaccctgggg gtcacagtgg tagcatttat gatttcatgg 



li 0 



gaaaatactg gtagcaagac agaatcatcc tcagagagtt acaaagccag agtggccagg 840 



900 



r.accatata gcatrgattc attaattgat gcctttatgg gctttataac c:ctgcctgt 960 

atrtatgaga rttgctgttg gtgtgcttat tataactcag xatgaatcc t i^gatttat 1020 

g<;tttatttt acccaiggtt taggaaagca ataaaagtta Ltgtaactgg tcaggtttta 3080 

aagaacagtt. cagcaaccat gaatttgt tt tctgaacata t.ataa 1125 

■:; ] 0> 8 4 
345 

•:.:i2: PRT 

•:: 1 3: • H . Sapiens 

■:-:00: 84 

M,:t !-er Ser As n Ser Ser Leu Leu Val Ala Val Gin Leu Cys Tyr Ala 

10 15 

A.--n ''al Asn GLy Ser Gvs Val Lys lie Pro ?he Ser Pro Gly Ser Arg 
2'f ' 25 30 

Vol Tie Leu Tyr lie Val Phe G ;. y Phe Gly Ala Val Leu Ala V„ 1 Phe 
3:> 4 0 4 5 

G^y i'.sn Leu Leu Yal Met Tie Ser lie Leu His Phe Lys Gin Leu His 
1 0 55 60 

:>r Pro Thr Asn Phe Leu Val AJ a Ser Leu Ala Cys Ala Asp Phe Leu 

7 0 7 5 8 0 

Vcl ("ly Val Thr Val Met Pro Phe Ser Met Val Arg Thr Val Glu Ser 
5 5 90 9; 

G'.'s Trp Tyr Phe Gly Arq Sor Phe Cys Thr Phe His Thr Cys Cys Asp 
100 " 10 5 110 

7...1 r.la Phe Cys Tyr Ser Ser Leu Phe His Leu Cys Phe lie S*r Tie 
115 1 0 12 5 

Asp Arg Tyr lie Ala Val Thr Asp Pro Leu Val Tyr Pro Thr Lys Phe 
I 30 13 5 14 0 

Tar "al Ser Val Sor Gly lie 2ys lie Ser Val Ser Trp Tie Leu Pre. 
14 5 15 0 15 5 16 0 

Lou Met Tyr Ser Gly Ala Val Phe Tyr- Thr Gly Val Tyr Asp Asp Gly 
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165 170 175 

Leu Glu Glu Leu Ser Asp Ala Leu Asn Cys lie Gly Gly Cys Gin Thr 
180 18 5 ^ 190 

Val Val Asn G^n Asn Trp Val Leu Thr Asp Phe Leu Ser ?he Phe lie 
195 200 205 

Pro Thr Phe lie Met lie lie Leu Tyr Gly Asn lie Phe Leu Val Ala 

2 10 215 22 0 

Arg Arg Gin A._a Lys Lys Tie Glu Asn Thr Gly Ser Lys Thr Glu Ser 
225 230 235 240 

Ser Ser Glu Ser Tyr Lys Ala Arg Val Ala Arg Arg Glu Arg Lys Ala 
245 25 0 255 

Ala Lys Thr Leu Gly Val Thr Val Val Ala Phe Met lie Ser Trp Leu 
260 2 65 270 

Pro Tyr Ser lie Asp Ser Leu lie Asp Ala Phe Met Gly Phe lie Thr 
275 280 285 

Pro Ala Cys lie Tyr Glu lie Cys Cys Trp Cys Ala Tyr Tyr Asn Ser 
90 295 300 

Ala Met Asn Pro Leu lie Tyr Ala Leu Phe Tyr Pro Trp Phe Arg Lys 
305 310 315 320 

Ala Tie Lys Val lie Val Thr Gly Gin Val Leu Lys Asn Ser Ser Ala 

325 330 335 

Thr Met Asn Leu Phe Ser Glu His lie 

.M0 34 5 

•:l!1D:- 85 

• _ 1 i l'i20 

•:212:- DMA 

■ :.: 1 3: ■ H . Sapiens 

•M00:- 85 

accargaarg agccactaga ctatttagca aatgcttctg atttccccga ttatgcagct 60 

got zt t gg.ja attgcactga tgaaaacatc ccactcaaga tqcactacct ccctgttatt 120 

i.dtg7catta tcttcetcgt gggatttcca ggcaatgcag ragtgatatc cacttacatt 180 

t.i.ca.iaat'ia gaecttggaa gaqcagcacc atcattatgc tgaacctggc ctgcacaqat 2-10 

cf.qc Qta f tcja-:cagcct cc:oc 1 1 cct g alt cactact atgcoagtgg cgaaaactgg 300 

a'.ct* tgg.jg atttcatgtg taagtttatc cgctLcagct tccatttcaa cctgtatagc 360 

agca' octot tcctcacctg tttcagcatc ttccgctact gtgtgatcat tcacccaatg 420 

ngctgctttt ccattcacaa aactcgatgt gcagttgtag cctg-gctgt ggtgtggatc 480 

attt jactgg tagctgtca t; tccgatgacc ttcttgatca catcaaccaa caggaccaac 540 
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agatcagcct 


gtctcgacct 


eacc 


agt t eg 


gatgaactca 


atact at t aa 


gtqgtacaac 


600 


otgattttga 


ctgcaagtac 


1 1 to 


tgc'.'tc 


ccct tggtga 


tagt gacact 


tigctatacc 


660 


acgatUtcc 


acact:tgac 


ccat 


gga.-tg 


caaactgaca 


getgecttaa 


gcagaaagca 


7 20 


ogaaggctaa 


ccattctgct 


actc 


ettgea 


ttttaegtat 


gttttttacc 


ct t ccat a t c 


780 


ttqagggtca 


ttcaggatcg 


aate 


t caecc 


tgctttcaat 


cagttgttcc 


attgagaatc 


R4 0 


agatccatga 


agcttacatc 


gttt 


eta ct a c 


cattatgetg 


ctctgaacac 


ctttggtaac 


90 0 


ct gttactat 


atgtggtggt 


cage 


gacaac 


tttcagcagg 


ctgtctgctc. 


aacag'.gaga 


9 bO 


igcaaagtaa 


gcgggaacct 


r gag 


caagca 


aagaaaatta 


gttactcaaa 


caacccttga 


1020 



::!10;- 8 6 

2 1 1- 3 3 6 

:212> PRT 

:213> H. Sapiens 



M^t Asn Glu Pro Leu Asp Tyr Leu Ala Asn Ala Ser Asp Phe Pro Asp 
1 5 10 15 

Tyr Ala Ala A L a Phe Gly A;sn Cvs Thr Asp Glu Asn lie Pro Leu Lys 

20 25 30 

Met His Tyr Leu D ro Val lie Tyr Gly Tie lie Phe Leu Va 1 Gly Phe 
35 4i' 45 

Pro Gly Asn Ala Val Val lie S*-r Thr Tyr He Phe Lys Met Arg Pro 
50 55 60 

Trp Lys Ser Ser Thr He ILe M*.-t Leu A,;n Leu Ala Cys Thr Asp Leu 
6 5 



7 0 7 5 $ 0 



Leu Tvr Leu Thr Ser Leu Pro Phe Leu lie His Tyr Tyr Ala Ser Gly 

3 5 90 95 

Glu Ar-n Trp He Phe Gly Asc Phe Met Cvs Lys Phe Tie Arg Phe Ser 

130 105 110 

Phe His Phe Asn Leu Tyr Ser S>;r He Leu Phe Leu Thr Cys Phe Ser 

115 120 125 

ILe Phe Arg Tyr Cys Val He lie His Pro Met Ser Cys Phe Ser He 

110 13 5 14 0 

His Lys Thr Arg Cys Ala Val V.H Ala Cys Ala Val Val Trp He lie 

145 150 155 160 

Ser Leu Val Ala Val lie Pro Met Thr Phe Leu He Thr Ser Thr Asn 

165 170 175 

Arg Tnr Asn Arg Ser Ala Cys Leu Asp I.eu Thr Ser Ser Asp Glu Leu 

180 185 190 
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Asn Thr Tie Lys Trp Tyr Asn Leu lie Leu Thr Ala Ser Thr Phe Cys 
1S5 200 205 

Leu Pro Leu Val lie Val Thr Leu Cys Tyr Thr Thr lie lie His Thr 
210 215 2 2 0 

Leu Thr His Gly Leu Gin Thr Asp Ser Cys Leu Lys Gin Lys Ala Arg 
225 230 235 240 

Arg Leu Thr lie Leu Leu Leu Leu Ala Phe Tyr Val Cys Phe Leu Pro 
245 250 255 

Phe His lie Leu Arg Val Tie Gin Asp Arg He Ser Ala Cys Phe Gin 
260 265 270 

!ier Val Val Pro Leu Arg He Arg Ser Met Lys Leu Thr Ser Phe Leu 

275 280 285 

Asp His Tyr Ala Ala Leu Asn Thr Phe Gly Asn Leu Leu Leu Tyr Val 
290 295 300 

Val Val Ser Asp Asn Phe "In Gin Ala Val Cys Ser Thr Val Arg Cys 
305 310 315 320 

Lys Val Ser Gly Asn Leu Glu Gin Ala Lys Lys He Ser Tyr Ser Asn 
325 330 335 

■:210:- 8 7 

:l:ll> 1138 

: 2 1 2 : - DN A 
-:213> H. Sapiens 

-i 0 0 • 8 7 



gtactgaact 


at t gaatgga 


act t ggaaat 


aaagt ccctt 


ccaaaa taac 


■ r .o 


cagagagt aa 


t aggt aaatg 


1 1 1 1 agaagt 


gagaggact c 


aaattgccaa 


i; o 


ttttattttt 


cel. cctaggt 


t tct gggata 


agt a tgtgca 


aataaaaaat 


180 


aggaactgta 


acctgattat 


gga 1 1 tggga 


aaaagataaa 


tcaacacaca 


24 0 


taaactgatt 


gacagccctc 


aggaatgatg 


cccttttgcc 


acaatataat 


3(0 


tgtgtgaaaa 


acaactggtc 


aaatqatgtc 


cgtgct t ccc 


tgtacagttt 


3f-0 


ataattctga 


ccacactcgt 


t ggcaat.ctg 


atagttattg 


tttctatatc: 


4 2 0 


caacttcata 


ccccaacaaa 


1 1 ggctcat t 


cat tccatgg 


c c a c t g t g g a 


4 h 0 


gggt gt ctgg 


teatgectta 


cagt atggt g 


agatctget g 


aqcactgttg 


5<;c 


gaagtct tct 


g t a a a a 1 1 c a 


cacaagcacc 


gacatta tgc 


t gage t cage 


6C-0 


cat t tgtctt 


tcatctccat 


t gaccgctac 


tatget gtgt 


gtgatccact 


6 00 


gccaagat ga 


atatctt.qgt 


tatttgtgtg 


atgatctt ca 


1 1 agttggag 


720 


gt 1 1 1 tgcat 


ttggaatgat 


ct t 1 c c gga g 


ctaaact tea 


aacgeget ga 


780 
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agagaLatat ta;aaacatg ttcactgcag aggaggttgc tctgtcttct ttagcaaaat 840 

atctggggta ct jaccttta tgacttcttt ttatatacct ggatctatta tgttatgtgt 900 

ctattacaga atatatctta tcgctaaaga acaggcaaga "taa:tag"g atgccaatca 960 

gaagctccaa attggattgg aaatgaaaaa tggaatttca :aaagcaaag aaaggaaagc 1020 

tgtgaaqaca ttggggattg tgatgggagt tttcctaa:a tgctggtgcc ctttctttat 1080 

otgt.ac.agtc atggaccctt ttcttcacta cattattcca :cta-:tttga atgatgta 1138 

-:210> 88 
296 

:212> PRT 
•:2 1 3: • H . Sapi ens 

:*100.- 88 

Met Met Pro Phe Cys His Asn lie lie Asn lie Ser Cys Val Lys Asn 
L 5 10 15 

Asn Trp Ser Asn Asp Val Arg Ala Ser Leu Tyr Ser Leu Met Val Leu 

2 0 2 5 3 0 

He lie Leu Thr Thr Leu Val Gly Asn Leu He Val lie Val Ser He 
3 5 4 0 15 

Ser His Phe L/s Gin Leu His Thr Pro Thr Asn Trp Leu He His Sc:r 
■3 0 3 5 60 

Met Ala Thr Val Asp Phe Leu Leu Gly Cys Leu Val Met Pro Tyr Ser 

6 5 7 0 7 5 8 0 

Met Val Arg Ser Ala Glu His Cys Trp Tyr Phe Gly Glu Val Phe Cys 
8 5 9 0 9b 

Lvs lie His Thr Ser Thr Asd lie Met Leu Ser Ser Ala Ser He Phe 
100 " 105 110 

H ls Leu Ser Phe lie Ser Tie Asp Arg Tyr Tyr Ala Val Cys Asp Pro 
115 120 125 

Leu Arc: Tyr Lys Ala Lys Met Asn He Leu Val Tie Cys Val Met He 
130 135 140 

Phe lie Ser Trio Ser Val Pro Ala Val Phe Ala Phe Gly Met lie Phe 
H5 ^ 150 155 160 

Leu Glu Leu Asn Phe Lys Gly Ala Glu Glu He Tyr Tyr Lys His Val 
165 170 175 

His 3ys Arg Gly Gly Cys Ser Val Phe Phe Ser Lys lie Ser Gly Val 
180 185 ■ 90 

Leu Thr Phe Met Thr Scr Phe Tvr He Pro Gly Ser lie Met Leu Cys 
195 2 00 20b 
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Val Tyr Tyr Arg lie Tyr Leu lie Ala Lys Glu Gin Ala Arg Leu lie 

210 215 220 

3er Asp Ala Asn Gin Lys Leu Gin lie Gly Leu Glu Met Lys Asn Gly 
225 230 235 240 

Ii.e Ser Gin Ser Lys Glu Arg Lys Ala Val Lys Tnr Leu Gly lie Val 
245 250 " 255 

Met Gly Val Phe Leu lie Cys Trp Cys Pro Phe Phe lie Cys Thr Val 

260 265 27 0 

Mhit Asp Pro Phe Leu His Tyr lie lie Pro Pro Thr Leu Asn Asp Ala 
2 7 5 280 285 

Arg Gly Ser Arg Ala Asn Ser Ala 

290 295 



:.':10 

: 2 1 2 

:.":13 



8 9 

1023 
DNA 

H . Sapiens 



«:-J00> 89 

gaaat gatgc ccttttgcca caatataatt aatatttcct gtgtgaaaaa caactggtca 60 

aatgatgtcc gtgcttccct gracagttra atggtgctca taattctgac cacactcgtt 12 1 

ggcaatctga tagttattgt tt:tatatca cacttcaaac aacttcatac cccaacaaat lf/i 

tggct cattc attccatggc ca:tgtggac tttcttctgg ggtgtctggt catgcrttac 2-!i 

agtatggtga gatctgctga gcactgttgg tattttggag aagtcttctg taaaattcac 2;(m 

acaagcaccg acattatgct gagct :agcc tccattttcc atttgtcttt cat.ct ;catt 

giiccgctact atgctgtgtg tgatccactg agatataaag ccaagatgaa tatcttqgtt 42'*'. 

at ttgtgtga tgatcttcat tagttqgagt gtccctgctg tttttgcatt tggaatgatc 

tttctggagc taaacttcaa aggcgctgaa gagatatatt acaaacatgt tcactgcaga ! 

gcsaggttgct ctgtcttctt tagcaaaata tctggggtac tgacctttat gacttctttt > 

tatatacctg gatctattat gttatgtgtc tattacagaa tatatcttat cgctaaagaa * 

caggcaagat taattagtga tgccaatcag aagctccaaa ttggattgga aatgaaaaa: 

ggaatttcac aaagcaaaga aaggaaagct gtgaagacat tggggattgt ga~qggagt*_ 

ttcctaatat gctggtgccc tttctttatc tgtacagtca tggacccttt tctcactac ! 

attattccac ctactttgaa tgatgnattg atttggtttg gctacttgaa ctctacattt 90 1 

aatccaatgg tttatgcatt tttctatcct tggtttagaa aagcactgaa galgatgctg 9 6 ■! 

tttggtaaaa ttttccaaaa agattcatcc aggtgtaaat tatttttgga attgagt.tca 1020 
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rag 



210 

211 
212 

2 13 



90 

339 

PRT 

H . Saoi ens 



vhOO:- 90 

Met Met Pro Phe Cys 



Asn Tro Ser Asn Asp 

2 0 

Tie lie Leu Thr Thr 
3 5 

:'(;r His Phe Lvs Gin 
5 0 

Met A J a Thr Val Asp 



r:ot Val Arg Ser Ala 

~ 5 

Lys He His Thr 

1 CiO 

H^s Leu Ser Phe I 1 e 
115 

Leu Ar g Tyr Lys A J a 
12 0 

Phe Lie Ser Trp Ser 
11 5 

Leu 01 u Leu A:s n Phe 

1 65 

ilis Cys Arg G.i y Sly 

Leu Thr Phe Met Thr 

1 ii5 

'Hi Tyr Tyr Arq He 
210 

;-?r Asp Ala Asn Gin 
22 5 

[le Ser Gin Ser lys 
24 5 

Met Gly Val Phe Leu 



His Asn He He 
Val Arg Ala Ser 



Leu Val Gly Asn 
4 0 

Leu His Thr Pro 
5 5 

Phe Leu Leu Gly 
7 0 

Glu His Cys Trp 



Thr Asp Tie Met 
105 

Ser He Asd Arg 
120 

Lys Met Asn He 

1 35 

Val Pro Ala Val 

150 

Lys Gly Ala Glu 



Cys Ser Val Phe 
185 

Ser Phe Tyr Tie 

200 

Tyr Leu He Ala 
215 

Lys Leu Gin He 

2 30 

Glu Arg Lys Ala 
He Cys Trp 2ys 



Asn He Ser Cvs Val Lvs Asn 
10 ^ 15 

Leu Tyr Ser Leu Met Val Leu 



Leu He Val lie w al Ser He 



Thr Asn Trp Leu He His Ser 
60 

Cvs Leu Val Met Pro Tvr Ser 

7 5 8 0 

Tyr Phe Gly Glu "\il Phe Cys 
90 9 5 

Leu Ser Ser Ala Ser He Phe 
. 10 

Tvr Tyr Ala Val Cys Asp Pro 
125 

Leu Val He Cys V,H Met He 
14 0 

Phe Ala Phe G y Met He Phe 
155 160 

Glu He Tyr Tyr Lvs His Val 
1^0 175 

Phe Ser Lys He Ser Gly Vai 
i 90 

Pro Gly Ser He Mot Leu Cys 
205 

Lys Glu Gin Ala Arg Leu lie 

2 20 

Gly Leu Glu Met L /s Asn Gly 
235 240 

Val Lys Thr Leu Hy He Val 

2 5 0 2 5 5 

Pro Phe Phe He Cys Thr Val 
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26C 265 270 

Met Asp Pro Phe Leu His Tyr lie lie Pro Pro Thr Leu Asn Asp Val 

27 5 280 28 5 

Leu lie Trp Phe Gly Tyr Leu Asn Ser Thr Phe Asn Pro Met Val Tyr 

290 295 300 

Ala Phe Phe Tyr Pro Trp Phe Arq Lys Ala Leu Lys Met Met Leu Phe 

305 310 315 32 0 

Gly Lys lie Phe Gin Lys Asp Ser Ser Arq Cys Lys Leu Phe Leu Glu 

325 330 335 

Leu Ser Ser 

210:- 91 
;!11> 1»S96 
212> DNA 
•-:213> H. Sapiens 

•M00:- 91 

otgtaaagta gattgiacga ggactccatg aggtcatcca cttcaagtcc ttggcatagg O" 

ataartactc aaaaggtgat gacaatggcg cagggaggga tgqtgacttg cctggagatg 12"'" 

cucaqcaccg tctcLcccat actegg'cat tcacaccatc attgattcac caggcaccac 1 1; v 

nocgt.gtcca gcaggactct ggggacccca aatggacact accatggaag ctgacctggg 2 4l 

tqccrictgqc cacaggcccc gcacagagct tgatgatgag gactoctacc cccaaggtgg 3 0'' 

ot:ggcjacacg gtcttcctgg tggccctgct gctccttggg ctgccagcca atgggttgat 3 

ggcgT ggctg gccggctccc aggcccggca tggagctggc acgcgtctgg cgctgctcct 4 2' 

gctcagcctg gccctctotg acttcttgtt cctggcagca gcggccttcc agatcctaga 

ga t coggcit gggggac.act ggccgctgqq gacagctgcc tgccgcitct actacttcc: 

atggqgcgtg tcctactccc ccggcctctt cctgctggcc gccctcagcc tcgaccgctg 

outgctggcg ctgtgcccac actggtac:c tgggcacegc ccagtccgcc tgccoctctg 

qgtc.gcgcc ggtgtctggg tgctggccac actcttcagc gtgccctggc tggtcttccc 

ogagqctgcc gtctggtggt acga:ctggt catctgcctg gacttctggg acagcgagga 

gotgrcgctg aggatgctgg aggtccr.ggg gggct^cctg :c:ttc:tcc tgctqctcgt 

c*;gc-:acgtg ctcacccagg cca:agcctg tcgcacctg;; ;accgccaac agcaqcccgo 

agcctgccgq ggcrtcgccc gtgtggccag gaccattctg tcagcctatg tggtcctgag 9*-2 

gctgccctac cagctggccc agctgctcta c;tggccttc ctgt.gggacg t:tactctgg 1020 

ctacctgctc tgggaggccc tggtctactc cgactacctg atcctactca acagctgcct 10R0 
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cagceccttc ctctgcctca tggccagtgc cga:ctccgg 
ctcgtccttc gcggcagctc tctgcgagga gcggccgggc 
acagacccag ct.agar.ictg agggtccaac tctgccagag 
acagdtggat cctgtggccc agcctcaggt gaaccccaca 
cacagctcag ccacag-::ga accctacggc ccagccaeag 
acagctgaac ctcatggccc agccacagtc agattctgtg 
ta.acgtccag acccctgcac ctgctgccag ttctgtgccc 
occd-acccoa tcctcg;atc ctaccccagg ggccct t gag 
ctctgaagga gaaagcccca gcagcacccc gccagaggcg 
gtgagggtcc aggaacacgc aggcccacca gag cagtgaa 
gaaccagcca gtcaqa 



aecctgctgc gctccgtgct 1140 

agcttcacgc ccactgagcc 1200 

cogatggcug aggcccagtc 1260 

ctccagccac gatcggatcc 1320 

tcggatccca cagcccagcc 1380 

gcccagccac aggcagacac 14 40 

agtccctgtg atgaagcttc 1500 

gacccagcca cacctcctgc 1560 

gccccgggcg caggccccac 162 0 

agagcccagg gcagacagag 168 0 

16 9 6 



-2 10- 92 

■:211- 505 

:2 12 ■ PRT 

-:2 13. H. Sapiens 



: 1 3 0 



92 



L»rj Ala Trp Arg Cys Thr Ala Pro Ser Leu Pro Tvr Ser 
L 5 10 

Thr : le lie Asp Ser Pro Gly Thr Thr Pro Cys Pro Ala 



2 0 



2 5 



Uy Pro Gin Met Asp Thr Thr Met Glu Ala Asp 



Leu Gly 
45 



Val lie Hi s 
15 

Gly Leu Trp 

30 

Ala Thr Glv 



His Ar-j Pro Arg Thr Glu Leu Asp Asp Glu Asp 



Ser Tyr 
60 



Pro Gin Glv 



Gly Trp Asp Thr Val Phe Leu Val Ala Leu Leu Leu Leu 
o5 10 75 

A-a Asn Gly Leu Met Ala Trp Leu Ala Gly Ser Gin Ala 



Gly Leu Pro 
80 

Arg His Glv 

9b 



Ala Gly Thr Arg Lea Ala Leu Leu Leu Leu Ser Lea Ala 
10 0 10 5 

?:ie Le.i ?hc Leu Ala Ala Ala Ala Phe 3 In lie 



1 1 5 



120 



Llv His Trp Pro Leu Gly Thr Ala Ala Cys 
135 

Tyr Ser Ser Gly Leu Phe 
150 



130 

Leu Trp Gly Val 

1 A 5 



Lea Giu 
125 



Arg Phe 
140 



155 
Page 



Leu Leu 
6 6 



Leu Ser Asp 
110 

lie Arg His 

Tyr Tyr Phe 



Ala Ala Leu 
160 
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Ser Leu Asp Arg Cys Leu Leu Ala Leu Cys Pro His Trp Tyr Pro Gly 
165 170 17 5 

His Arg Pro Val Arg Leu Pro Leu Trp 7a 1 Cys Ala Gly Val Trp Val 
180 185 19C 

Leu Ala Thr Leu Phe Ser Val Pro Trp leu 7al Phe Pro Giu Ala Ala 
195 200 20. c 

Val Trp Trp Tyr Asp Leu Val He Cys Leu Asp Phe Trp Asp Ser Glu 
110 215 220 

jlu Leu Ser Leu Ara Met Leu Glu Val Leu Gly Gly Phe Leu Pro Phe 

22 5 " 2 30 235 24 0 

Leu Leu Leu Leu Val Cys His Val Leu Thr Gin Ala Thr Ala Cys Arg 
245 2 5 0 2 5 ^ 

Thr Cys His Arq Gin Gin Gin Pro Ala Ala Cys Arg Gly Phe Ala Arg 
2 60 2 65 270 

Val /via Arg Thr lie Leu Ser Ala Tyr Val Val Leu Arc Leu Pro Tyr 
275 28 0 28 5 

Gin Leu Ala Gin Leu Leu Tyr Leu Ala Phe Leu Trp Asp Val Tyr Ser 
_-90 2 95 30 0 

Gly Tyr Leu Leu Trp Glu Ala Leu Val Tyr Ser Asp Tyr Leu He Leu 
305 310 315 320 

Leu Asn Ser Cvs Leu Ser Pro Phe Leu Cys Leu Met Ala Ser Ala Asp 

32 5 330 33 5 

Leu Arg Thr Leu Leu Arg Ser Val Leu Ser Ser Phe Ala Ala Ala Leu 

34 0 34 5 35C 

Cys Glu Glu Arg Pro Gly Ser Phe Thr Pre- Thr Glu Pro Gin Thr Gin 

35 5 360 3 6 L . 

Leu Asp Ser Glu Gly Pro Thr Leu Pro Glu Pro Met Ala Glu Ala Gin 
•,70 375 380 

Ser Gin Met Asp Pro Val Ala Gin Pro Gin Val Asn Pro Thr Leu Gin 
385 390 395 -400 

Pro Arg Ser Asp Pro Thr Ala Gin Pro Gin Leu Asn Pro Thr Ala Gin 
■3 0 5 4 10 4 1 5 

Pre Gin Ser Asp Pro Thr Ala Gin Pro Glr, Leu Asn Leu Met Ala Gin 
4 2 0 4 2 5 4 3 C 

Pro Gin Ser Asp Ser Val Ala Gin Pro Gin Ala Asp Thr Asn Val Gin 
4 3 5 4 4 0 4 4 5 

Thr PrD Ala Pro Ala Ala Ser Ser Val Pro Ser Pro Cys Asp Glu Ala 
4 50 4 55 4 60 

Ser Pro Thr Pro Ser Ser His Pro Thr Pro Gly Ala Leu Glu Asp Pro 
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1 6 5 



•17 5 



4 80 



Ala Thr Pro Pro Ala 3er Glu Gly Glu Ser Pro 
4 85 -190 

O.lu Ala Ala Pro Gly Ala Gly Pro Thr 
SCO 505 



10:' 93 

ill:- 1U 3 

112, ■ DNA 

213: K . Sapiens 



Ser Ser Thr Pro Pro 
■19 5 



•ICO • 93 
atggHcacia ccatggaag 



qa 



aaee> 
oa g 
ga~t- 
t ;tg 

qoCC 



: tgacctgggt gccactggcc 

; ccaaggtggc t.gggacacgg 

a tgggttgatg gcgtggcrgg 

: gctgctcctg cteageetgg 

i gatcctagag ateeggeatg 

a ctaettecta tggggcgtgt 

t cgaccgctgo etgetggcgc 

1 gcccctctgg gtetgcgeog 

ggtettecee gaggctgccg 

■:agogaggag -tgtcg;tga 

gc:gctcgtc *_qc=ac gtgo 

gcagcccgc a gcctgccggg 

ggtcctgagg otgccetaeo 

ztactctggc taoctgctct 

cag:tgcc:: ag-jcccttc: 

■:tc:gtgct: rcgtccttcg 

gggca gcttcacg:c cactgagcca :aga^:;:ag: 

agaqc cgatgg.:aga ggeoeagt ca eaqatggate 

cacac tccagccac^ atcggatccc acagct:ag: 

a :agt eggatcccae ag:-:cagc:a cagctgaacc 

t gtqq ccc3gcco:a ggcagaca:t aacgtccaga 

grcca gtcectgtga tgaagctr.ee ccaaccrjcat 

tgaqg acccagccac acctcctgcc tctgaaggag 

Page 



^at'ji'.t gagg acicctacc 

.:i-.c:i.tggi]C tgeeagoe a 

gqaqetggea cgcgtctgq 

ggcageag eggoctte: 

agctgoot geegcttot 

g-vggc^g cccLcageet. 

g:,iccgcc eagtccgeee 

r.' cagog :gc:ctgg:t 

jtgcctgq ac:-:ctggga 

:: f ectge c:t:cc:c-:l 

;3octg':c a c c g c c a a e a 

ra':tctqt cagectarg: 

ggeeet ;;c t g t ggga e g t 

rtacct :ja tcotacneaa 

-egga cc:tgctg:3 



acaggccccg 


cacagagctt 


l 30 


ect t cctggt 


ggcectgctg 


130 


ccqgctccca 


ggecoggeat 


i. y o 


ccct ctctga 


cttcttgt tc 


2 -5 0 


ggggacactg 


g -cgctgggg 


3' 0 


cctactcctc 


eggeetcttc 


;-, i . i i 


tg-gcccaca 


ctggtaccct 


g. ii 


gtgtctgggt 


g-::ggccaca 


-I - ; 0 


tc-.ggtggta 


c gaeetggtc 


5-1 0 


ggatgctgga 


gqteetgggg 




t :acccaggc 


c-acagcctgt 




gettcgeeeg 


tgt ggceagg 


7 . "0 


agotggecca 


get get eta c 




g gqaggcc=t 


g gr :tactcc 


H -1 0 


tot gectcat 


ggeeagrgee 


a 1 1 j 


eggcagetct 


etgergaggag 


9 - 3 ) 


t agattc:,ga 


g g gtecaact 




ctgtggc :ca 


gecteaggtg 


1 0 ''"> ■ J 


c acagct ga a 


- ret: -eg j • : 


11-10 


Lcatggc:ci 


go :acag:ca 


i 2 : j :) 


cc:ctgcac: 


tgrtgccagt 


1 2 »5 :) 


c:t cg:atc: 


t aecccaggg 


132 3 


aaagccccag 


cagcaccccg 


138 0 
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ccagaggcgg ccccgggcgc aggccccacg tga 1-113 



:?.lO 
:211 
:212 
:213 



94 
PRT 

H . Saoiens 



•MOO:- 94 

Mot Asp Thr Thr Met Glu Ala Asp Leu Gly Ala Thr Gly His Arg Pro 
15 10 15 

Arg Thr Glu Leu Asp Asp Glu Asp Ser Tyr Pro Gin Gly Gly Trp Asp 

20 25 30 

Thr Val Phe Leu Val Ala Leu Leu Leu Leu Gly Leu Pro Ala Asn Gly 
3 5 4 0 4 5 

Leu Met Ala Trp Leu Ala Gly Ser Gin Ala Arg His Gly Ala Gly Thr 

5 0 5 5 60 

Arg Leu Ala Leu Leu Leu Leu Ser Leu Ala Leu Ser Asp Phe Leu Phe 
6 5 70 7 5 8 0 

Leu Ala Ala Ala Ala Phe Gin He Leu Glu He Arg His Gly Gly His 
85 90 95 

Trp Pro Leu Gly Thr Ala Ala Cys Arg Phe Tyr Tyr Phe Leu Trp Gly 
100 105 110 

Val Ser Tyr Ser Ser Gly Leu Phe Leu Leu Ala Ala Leu Ser Leu Asp 
115 120 125 

Arg Cys Leu Leu Ala Leu Cys Pro His Trp Tyr Pro Gly His Arg Pro 
13 0 13 5 14 0 

Val Arg Leu Pro Leu Trp Val Cys Ala Gly Val Trp Val Leu Ala Thr 
14 5 150 155 160 

Leu Phe Ser Val Pro Trp Leu Val Phe Pro Glu Ala Ala Val Trp Trp 
16 5 170 175 

Tyr Asp Leu Val He Cys Leu Asp Phe Trp Asp Ser Glu Glu Leu Ser 
180 185 190 

Leu Arg Met Leu Glu Val Leu Gly Gly Phe Leu Pro Phe Leu Leu Leu 
195 200 205 

Lfu Val Cys His Val Leu Thr Gin Ala Thr Ala Cys Arg Thr Cys His 
210 215 2 2 0 

Aig Gin Gin Gin Pro Ala Ala Cys Arg Gly Phe Ala Arq Val Ala Arg 

225 2 30 23 5 " 24 0 

Thr He Leu Ser Ala Tyr Val Val Leu Arg Leu Pro Tyr Gin Leu Ala 

2 4 5 2 5 0 2 5 5 
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Gin Let 



Leu Tyr 
260 



Leu Ala Phe 



Leu Trp 

2 65 



Asp Val Tyr Ser 



Gly Tyr Leu 
27 0 



Leu irp Glu Ala Leu 



Ser Asp 

280 



Tyr Leu II 



e ^eu 

2i 5 



Cys Leu 

290 

Leu Leu 

30 5 

Arc r ro 



Ser Pro Phe Leu 



Arg Ser 
Gly Ser 



295 



Leu Met Ala Ser 



Val 



Phe 
325 



Leu 

310 



Ser Ser Phe 



Thr Pro Thr Glu 



Ala Ala 
315 



Pro Gin 
3 30 



Ala Asp 
300 



Ala Leu 



Thr Gin 



Leu Arg Tnr 



Cys Glu Glu 

32 0 

Leu Asp Ser 
335 



Glu Cly 
Asp Fro 



Pro Thr 
340 



Leu Pro Glu 



Pro Met 
34 5 



Ala Glu Ala Gin 



Yal Ala Gin Pro Gin 



Val .Asn 
360 



Thr Ala Gin Pro 



Gin 

375 



Leu Gin 
365 



Leu Asn Pro Thr 



Ala Gin 
380 



Ser Gin Met 
3 50 



Pre* Arg Ser 



Pro Gin Ser 



■p J ro Thr Ala Gin Pro Gin Leu Asn 

3 90 



Leu Met 

395 



Ala Gin 



Pro Gin Ser 

400 



::p I er Val Al£ 



Gin 
405 



Pro Gin Ala Asp 



Thr Asn 
4 10 



Val Gin 



Thr Pro Ala 
4 15 



i'ro Ala Ala 



: i ••. 9 : 

21: 4 ;< 

:212: DMA 

:213 :■ Artificial Sequence 



rn i sc_ f eat are 
NoveJ Sequence 



■:4 0m: - f - 

"Cddacct i: atggaatcat ctttctcatt tggagtgatc cttgctgtc 



1 2 : ( NA 



j. . i : 



7\r t i f icial Sequence 



:22 V 



ri : s c_ feature 
Novel Seauence 



. 4 0'-* • 96 

ttcactcqdf] ttagecatea aactctgagc t.ggagatagt gacgalgtg 
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:210:- 97 
: 2 ] 1 : • 22 



:212 



UNA 

Artificial Sequence 



220 • 

:221;- rni sc_f mature 
:2 2 3.- Hovel Sequence 



q-t.caaciv'-a ctcatctatg cc 



.■12- ON A 

2 1 i.- Artificial Sequence 



li nisc_feature 
! .i • liuvel Sequence 



:t tc* ''t gcccttaccg tc 



■12- dma 

:. V Artificial Sequence 



2 2 12' fi.: sc_f eat u re 
22. i ■ !■'(■ vel Sequence 



. 4 Cm); <_,0 

jdaqcagruc cccgaatacc 



;■]!]:■ IC'O 



.1 i:- Artificial Sequence 



1 : * n.i sc_ feature 
J ; tit vel Sequence 



4 0 0 : ■ : < ■ 0 

:atg< - :tca<ic ctgagcgtca 



U0:- I L- 1 
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:n> 2 r 

112- DMA 

:!?•> Artificial Sequence 



m isc_f eature 
Hovel Sequence 

:.tcdaagc:r.t atggagtcgg ggctgctg 28 
' ~ 0 . ■ i i" ! 2 

on a 

Artificial Sequence 

' 0 . ■ 

,:J ■ mi i,c_f eature 

tJi'ivel Sequence 

10 ■ 102 

'dOtcq.uq tcagtctgca gccggttctg 30 



10 • 


10 2 




1 1 ■ 


3 0 




12 ■ 


DMA 




1 .-; ■ 


Art i 


ficial Sequence 


2 j ■ 


mice 


feature 



IJ-ivel Sequence 



30 



3<> 
DMA 

Artificial Sequence 



.'2 1 ■ nuc_feature 
22 3 ■ Novel Sequence 



; J00 • 114 

j 4t .i ;;agtgc acagataqcg gccaggatqc 



\ 10 • 105 
■: Jl ■ ■ 10 
:212 ■ DMA 

<213 ■ Artificial Sequence 
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mi sc_fea t u re 
Novel Sequence 



•1 0 0 > 1 n 5 

,ji':cccatca tctacacgc 



19 



in 6 
lfi 



2:1 j:- 



Artificial Sequence 



raj. sc_feature 

N< - vel Sequence 



•;i:<-|> 106 

acc:gti.]ca gccgctgg 



18 



2::- ntJA 

.'13- Artificial Sequence 
..2 0.- 

^'21> ni;sc_feature 

..2~o- r level Sequence 

mOU • : "7 

^" at. ciagc:. t ccatqtacaa cgqgtcgtgc tgc 33 



..' I ] : ■ 3 3 
.'12:- LUJA 

213 ■ ATtificial Sequence 



niisc feature 
Hovel Sequence 



■ICO:- 103 

"a t_ *_ ctana tcagtgccac icaacaatgt 999 



33 



210 




DNA 

Artificial Sequence 




misc teat u re 
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Novel Sequence 




1 '.' 9 

c a < ; c actgtttacc 



20 




• ji; L'tiA 

• 2 1 '.- • /artificial Sequence 



22]: mi ;-c_f eature 
22 3:- Novel Sequence 



•: .;[J(.- 1 . ( 

t ..jaaa tac (. t qtccacagcc 20 



• ;• i " : : j 
2 i i ;•■ i 
212 r.'t.'A 

• ; i '.v .Ar tificial Sequence 



0 



n : c _ f e a t u r e 
Ucvel Sequence 



• 1O0:- 111 

■ 1,, t c.agctt atgacaggtg acttcccaag tatgc 



3 5 




1 1 

3 4 

piia 

Artificial Sequence 



r;c_£ ea t ure 
I J cp v el. Sequence 



•i ••; "• : ; 2 

j. 3* c:( - tccrng gotaacqqea caaaacacaa t tec 



1 3 



DMA 

Artificial Sequence 



j. 20 

221 

223 



n v :.ic_f eat ure 
IJovel Sequence 
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■:400> 113 

.:agcccagac atccaagtc 



-■211.* 
■ :212> 
■'213> 



Artificial Sequence 



114 

19 

DNA 



misc feature 
Novel Sequence 



•-■100:- 114 

aocccactta atcagcctc 



19 



• j i 0 : • 115 

• : i 1 : 3 4 

■ 21 2:- DNA 

3> Artificial Sequence 

• 0 • 

2 1 * misc_ feature 

• 2 2 3 ' Novel Sequence 

■ A 00:- 115 

gatcgaattc gcaggagcaa tgaaaatcag gaac 34 

■ 2 1 0 • 116 

• 211 • 3« 

■ 21 2 ■ [)NA 

• 213 • Artificial Sequence 

■ 2 20 • 

• 221 nu sc_f eature 
213. ■ Novel Sequence 



4 00- ::6 



gatcgaat t c ttatatatgt tcagaaaaca aattcatgg 



3 9 



•110 



: : 7 

2t' 
DNA 

Artificia 1 Sequence 




4 0 0- 117 

". oagocccaa agccaaacac 



20 



. 10- 
.. 11.- 



1 1 s 




DNA 

Ar t i ficiai Sequence 
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:cgcaggagc aatgaaaatc ag 



* J : ■: • 119 
•:2 L .: ■ 19 
:. 1^ - DNA 

: . ■ 1 : • Artificial Sequence 
: •; 0 0 ■ 11 «■» 

c- guuaqi. tq tcgctgacc 



•:jlO-- 120 

. 1 1 21 

■: j 12 ■ D1JA 

:_1.V:- Art. if icia L Sequence 
... 2 0 ■ 

_21. ni... :.c feature 

/ . 2 1 Hovel Sequence 

• 4>:i 11C 

:i;at taU'tja cactttgacc c 



■ : . 1 121 

. i : :s 

:. 12 DNA 

1 2: Artificial Sequence 

•; ■ • • in 

i- -<it dccat g aatgagecac tagac 

■'."(.■■ 122 

-11 ■ 3,u 

2 12 DMA 

■2 12- Artificial Sequence 



2' 2 1 * riL^'.c feature 
:j2 2 * l!(vel Sequence 



: -I 0 0 1 2 2 

■ ] ■ ' ' i * ■ ." t c ( 1 1 1 q tcaaggqttq tttqaqtaac 



2 J' - 122 

:. 1 ■ 2!) 

L2 - ruiA 

<..!o • h:: i f i cia i Sequence 

<2 2u • 

<221 ■ n i :;c_ feature 

<22 3 • Novel Sequence 




P( I71SOO/315S1 



22 



1 9 



21 



25 



30 
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WO 01/36473 



PCT/l : SOU/3 1581 



-■A 00> 17.3 

<.:tgtctctct gtcctcttcc 



20 



•: J10: 
: . ' 1 1 



124 



•:.:12> DNA 

•:.":i.-i. Artificial Sequence 



■221- rr,isc_featurc 
J 2 :> ■ Novel Sequence 



• 4 0 ij:.' 12 4 

'ici'v gat j: tcattgaatt tc 22 



• Lu ■ i: 5 



DMA 

Artificial Sequence 



mi sc_f eature 
Hovel Sequence 



4 mm; 



125 




22 



• 2 1 (:■ 

• : 1 : 

■ ; i; : 

■ : l -v 



Artificial Sequence 



1 2 6 
1 t 
PTiA 



iTu sc^feature 
tJC'Vel Sequence 



«"• : 12 6 

cdcaqca taqtaqcq 



18 





2 1 



Artificial Sequence 



rr.isc feature 
Hovel Sequence 



4 0(. 



127 
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caqagctl ga tga'gaggac 



: : : : o • are 

.ni.- :■" 

::':12.- pka 

A: tif icial Sequence 

j so 

111 ■ n.. s cofeature 

3 t.'i.-vel Sequence 



ragc^a gtagtagaag 20 



F-T-.T 

Synthetic substrate peptide 



in , ?c_feature 
IJ.jvel Sequence 



A.Li Pro Arg Thr Pro Gly Gly Arg Arg 
5 

10 - 1 U) 

■:3 13 ■ Artificial Sequence 

2 21 • ini^:c_f eature 
• .. J < ■ !l -)^el Sequence 



■ icqraat icg actcactata gqgagaccgc gtgr.ctgcta gactctattt cc 



, i 0 - L 31 

111- J.) 

: ■ i)NA 

V.. 3 ■ Artificial Sequence 



:2 J L ■ "ii^c feature 
:22 i ■ Hovel Sequence 



•:4 00 > 131 

tgc^acactg atqcaactcc 
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<210> 132 

.21: as 

■:; 12:- DMA 

■:213~- Artificial Sequence 
■:nO-j:- 132 

iirqtc'iataoq actcactata gggagacctg ccacactgat gcaac:cc 



: - 133 

'.' i l > 2-4 

■Ijr- DIJA 

.'l./-:-' /artificial Sequence 



1 2 1 • nusc_f eature 
'. ■ ikjvel Sequence 



■ •? (n.»> 1 3 3 

■ i'.:<jt c t ct qc tagactctat ttcc 



i • 13^ 

11: Sf 

12-- DIJA 

:13 ■ Artificial Sequence 



■l(n:- 1 :-A 

qcqi aatucq actcactata gggagaccgc acgccactct ttactatccc 



-10- 11-5 

ill:- 



DIJA 

Artificial Sequence 



mi J'c__f eature 
[Jttvel Sequence 



H.\'K.'J 3<3a( 



/artificial Sequence 



m . :.; c _ feat ure 
liovel Sequence 



■ : 4 0 0 ■ 1 3 b 

i.|cgt jatacq actcactata gggagaccgc acaaaacaca attccataag cc 

Pace 79 



WO 01/36473 



PCT/IIS00/315X1 



-:210> 137 

•111 • Z, 

■:212:- vva 

■ : 13- A) tificial Sequence 



•221 r.i sc_f eat ure 
■22 3:- Level Sequence 



• -J 00:- ] :■" 

'icit.acgcccc tctttactat ccc 



■ 2 !(■ ■ 1 2B 

■ : : 2 : : <1 2 

■ : 22. ■ r-r.'A 

■2. 23:- Artificial Sequence 



r.i sc_f eat ure 
I.'ovel Sequence 



■■:()■:: I 

|: :c; t.,ja i CJ <:q actcactata gggagacctt atgagcagca attcatccc 



. • 1 ( • ! 

.• i : : (i 

2:12 DMA 

2113 ■ /'wtificial Sequence 



■ : 2 2 !. r< r. c feature 

■222 Novel Sequence 

• .) 2 1 .' • . •> 

..: ..iC: i-: ccduc aagaaatcag 

. 0 .10 

2:2 1. -13 

■3J22 • :)?Jrt 

■:2j.3 Artificial Sequence 
■:22') ■ 

■: 2> ■ ■ m:;c_feature 

■:22 ) ■ Novel Sequence 

;-100 • 110 

gcqt iataog actcactata gggagaccca cacccaccaa gaaatcag 



C210 - 1-11 

Page 8 0 
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WO 01/36473 



PCT/IS00/31581 



2 1 1 > 21 

.■12:- DtJA 

2" 1 3 :■• Artificial Sequence 
2 2 0 : ■ 

..22:- ;u.sc_feature 

2 2 3:- Novel Sequence 



A no:- 141 

tatqagcaq caattcatcc c 21 



2 1 0 > 1 A 2 

211- 4 9 

21 2:- DtJA 

2 13:> Artificial Sequence 
2 2 0: ■ 

223:- misc_feature 

223> Novel Sequence 

AOO:- 142 

cqtc-.atacq actcactata gggagacccg attatccaca ctttgaccc A9 



210:- 1-13 

211- 19 

212: DNA 

213:- Arn if icial Sequence 
220 • 

221. misc_feature 

223- Novel Sequence 



■ 1 , 0 0- 1 A 3 

t gaaagttq tcgctgacc 19 



210: 1-1 -1 

..: 1 1 5 0 

212 DMA 

213:- Artificial Sequence 




mioC_ feature 
Novel Sequence 



•iOO:- 14A 

ugtciateicg actcactata gggagaccct. gctgaaagtt gtcgctgarc 



50 



210:- 145 
2 1 1 : ■ 2 1 



212:- DMA 

213:- Artificial Sequence 
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■220. > 

2; l: misc f eature 
Z7 3 ■ Novel Sequence 



■ ■iic-o:- :-i2 

;.a<-, f. at oca cactttgacc c 



21 



: : 0 : 

■: i : •• 5C 



3:- Art if xcial Sequence 



n.LS cofeature 
Nc vol Seauence 



■-ft 



( : ( • 1M 

qi ,-aLaai actcac:a:a gggagaccct gtaaaattca cacaagcacc 



; 1 (■■ 
2 1 ; 



di:a 

A.: t 1 ficial Sequence 



■ ( 1 



m.: :-c_feature 
tk^vei Sequence 



« M,i- iacti' i £ "i ccaacctcc 



:j - 1 ;c 

: 2 : • 4 k 

2 12 - DMA 

:2l3 Arvificial Sequence 



22 ~ ni:;c_feature 
22 i ■ N r/ol Sequence 



- : I ) 1 ■ 1 ■; H 

JqC'jt aat-'n: gactcactat agggaqacca gnagacagag caacctcc 



48 



:2 i 2 DMA 

:2li- Artificial Sequence 



:220 • 

<221-- misc feature 
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223> Novel Sequence 



4 C 0 : ■ 14 9 

t.ntaaaatt cacacaagca cc 



210:- 


If 0 


21 1> 


3 1 




L-liA 


2 i 3 > 


Ar t 


2 2 0 • 





^21 ■ nasc_f eature 
2 2 3-- Novel Sequence 



■-,('■(■■ 150 

catagatrc tctttgctgt atttcaccct c 



210 151 

211- 31 

212 ■ l:UA 

213 Ai.tificial Sequence 



■;oo ■ im 

i- (J t aaati c acaatgccag tgataaggaa g 



22 1:- ru sc_f eature 
.'23: Ih.ivel Sequence 



i.itcaagct ggaatgatqc ccttttgcca c 



' I C - 1S3 

2 11- 2 

212- DriA 

213:- Artificial Sequence 




mi sc_f eature 
Novel Sequence 



:.•■<: 1 . ! 2 

j ; i . 3 i 



2 12: I )I JA 

^13> Artificial Sequence 




ni.sc feature 
N^vel Sequence 
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• • 153 

j cct cgag catcattcaa agtaggtgg 29 

2:0 • is-; 

. : : i • 4 :■ 

:: '..2 ■ dija 

.: : i • Artificial Sequence 

.::.0 • 

1'- m:. sc_f eature 

.'."3.- Novel Sequence 

■:4mO . ■ If- 4 

I.;- critcqct] ctatqaactc aattccaaaa ataatttaca cc 42 



pna 

Artificial Sequence 



rr,:. :.ic feature 
tlC'Tei Sequence 



: . .1 1 ■ 4 H 

ll.-. ■ /•.rtificiaJ Sequence 



r.:i. ac_teaL ure 
Novel Sequence 



1 n * 6 

: gaaaa dtgeataaac cattggatta aatqt agagt tcaagtagc 



' — .i n 
1 



UNA 

Artificial Sequence 



mi :~c- _ f eat u re 
Novel Sequence 



■:4')'J • 157 

ga*:cgaattc atgqacacfa ccatggaagc tqacc 35 
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■:210> 158 

... II • 31 

■:212:.- DNA 

• _ 1 J . • Artificial Sequence 
• : : 2 0 : • 

-:22i:- mj.sc_ feature 

■'2 23:- Novel Sequence 

■MOO:- 158 

g^tcctcgag tcacgtgggg cctgcgcccg g 31 



■VIU> 159 

• . 1 1 • 5 2 

• 12 DMA 

■ 2 1°.: Artificial Sequence 



m i. r.c_ feature 
U'.ivel Sequence 



• j u 1 1 ; ■ 1 5 H 

gcqtaatr.ccj actcactata gggagaccgc gtgtctgcta gactctattt cc 



52 



1 ■■ ' 



1" ("i 



■ : i 



Art if icial Sequence 



m± s c_f ea t ure 
Novel Sequence 



' gccacan.c; atgcaactcc 



2C 



■ . ■ 1 (.-■ 
.11 
.12 

• 2 1 



4 H 
Lit J A 

Ai t i f i c i a 1 Sequence 



rn_ <-c feature 
tk-vel Sequence 



■) in •: • "i b 1 

Kgtaatacg actcactata gggagacctg ccacactgat gcaactcc 



111:- 



1 62 

2 4 
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:212- DrJA 

• Artificial Sequence 

22 o .* 

22 1 ■ rn L3C_f eature 
22 • Novel Sequence 

■h .1 '"I * ln2 

■3.>r.'jtct'.jc tagactctat ttcc 2 4 

■:jU>.- 1 o2 

■211 • ISO 

:)..:■ UNA 

]!•'■■ Artificial Sequence 

■'22: ■ rr. i.^c feature 
■:. .' V Novel Sequence 

A 0 - lo2 

qc'it.iatacc actcactata gggagaccgc acgccactct ttactatccc 50 
2 ](.':■ 

: ^ 1 1 - 2 4 
■:. 1 2: - [>UA 

■:21 •'■ * Artificial Sequence 

']:• ruLC_feature 

• 2 2 '.: •• Novel Sequence 

• j.-u:- In-; 

..l.-.icjaaai^i caattccata agcc 24 

... 1 (.'::■ 

• 2 11.- 

'2 12:- L J I J A 

2 1 • Artificial Sequence 

. 0: ■ 

: 1 : • ri i s c_f eature 

Iiir;ei Sequence 

■}■)' . in-;. 

.jcg*" Tiat acq actcactata gggagaccgc acaaaacaca attcoataag cc 52 



I!''' 1 <M- 

: 2 1 1 : 2 3 

:212 - DNA 

::213 ■ Artificial Sequence 



Paae 86 





WO 01/3t»473 



PCT/tSlMl/31581 



220 



rrisc_f eature 
Novel Sequence 



100: 1 (-, 6 

■:tacgccuc tctttactat ccc 



23 



210 



1 67 



.23:- Artificial Sequence 
;'■():■■ 

111.- nasc_feature 

Novel Sequence 

■100: 167 

. tr.aatacq actcactata gggagacctt atgagcagca attcatccc 



j 10 • 1 08 

ill:- _ i, 

212:- nil A 

21 i ■ Artificial Sequence 
.-::0:- 

222 ■ misc_f eature 

2 23'-- Hovel Sequence 



.•,00:- 168 

-xarccacc aagaaatcag 



.:!0- 169 

ill'- h8 

212 • DNA 

."J I 3 ■ Artificial Sequence 

2.20 ■ 

221 ■ n_sc feature 

223 ■ Novel Sequence 



■100 169 

i--gt.iatacg actcactata gggagaccca cacccaccaa qaaatcag 



210:- 170 

ill: 21 

212> DNA 

213:- Artificial Sequence 
220 ■ 

• 221'- misc_f eat ure 

• 223.- Novel Sequence 
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.AOr, . 170 

rtj^qagcaq caattcatcc 



M : 1 - 1 V I 

M : - 4 9 



DMA 

Artificial Sequence 



.J . ' ] • m L r. c_ feature 

Novel Sequence 



tot". <ia t acq acLcactata ggqagacccg attatccaca ctttgaccc 



4 9 



19 
DMA 

Artificial Sequence 



n : t c_± eat ure 
Novel Sequence 



1 2 

: ac1 t cj tcqctqacc 



19 



DM/, 

Art ificial Sequence 



. 1 : ■ n .:, e c_ feature 

Novel Sequence 



V.'g'.Mdtacq acrxactata qgyagaccct gctqaaagtt gtcgctgacc 



1 



DNA 

Artificial Seauence 



: 1 : rn : :^c_f ea ture 
"* Novel Sequence 



4 0 'J ■ 174 
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•jattatcca cactttgacc c 



.no 

211 



HA: 



175 

50 

ONA 

Artificial Sequence 



2 20> 

221-- irusc_feature 
j_'3;- Novel Sequence 

A ( 1 0 : - 1 "7 5 

cqtaataog actcactata gggagaccct gtaaaattca cacaaqcacc 50 

.10- 1^6 
. ' 1 ] • 19 

diia 

..'13.- Artificial Sequence 
2 20 • 

221'- misc_f eature 
.'.";■>. Hovel Sequence 

A i.'Ci • 1^6 

naaqacaga gcaacctcc 19 

^10- IV 7 

L. i ] - " 17 
DIIA 

j 1 3 ■* Artificial Sequence 
s/:0 ■ 

:.'.']:- misc_feature 

L'1.3-- Novel Sequence 

<;00> 177 

cut Matacg ac:cactata gggagaccaq aaqacagagc aacctcc 47 



112-- DI J A 
13.- Artificial Sequence 

msc_foature 
22'A.- Hovel Sequence 

:<I'i0 ■ 178 

:r:jtaaaatt cacacaaqca cc 22 
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:210- 17 H 

■ 31 

:212 • L »M7i 

..13 . ■ Artificial Sequence 



ni:.i. : cofeature 
Uovel Sequence 



jua*. ggat c:c tctttgctgt atttcaccct c 



• :' ; u: ■ : £(■ 

• : : l 

• 2 12: LI! A 

• 2 : i Artificial Sequence 



: I : ■ rii i.-c_f eature 

Novel Sequence 



■;0n: Iff' 

ji ; tJ * qaat. t. c acaatgccag tgataaggaa g 



2 10' 181 

: ; 1 : 2 0 

212 ■ ON A 

.'l.-: Artificial Sequence 



•2"2i:* n .:. sc_ feature 

•223:- Novel Sequence 

-Kin:. ] r 1 
^':di.:t:ccccfl agecaa a ca c 

3 1 l-:2 

•211: 22 

. 1 . UNA 

■:2 11V- Artificial Sequence 



. 2 2 ! : • rnri_;c_ feature 

■:22.V Novel Sequence 

■:«U\> :R2 

.:cg .:. j ggagc aatgaaaat: ag 

2l0 ■ ri3 

:21 1 1. :) 

;212 ■ DNA 




PC I7USOO/315N1 



31 



31 



20 



22 



Tage 90 
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- 213> Artificial Sequence 

■ 220 ■■ 

■ 21:1.- mi sc_f eat ure 
•22 3- Novel Sequence 



■ 400 ■• 183 

ctgtctctct gtcctcttcc 



■ 210. • 18 4 

•■.11 - 22 

• 212 - DtIA 

■213- Artificial Sequence 

• 220 • 

• 221 • n;.sc Jeature 
.23- Novel Sequence 



• 4 00 • 18 4 

a:ac-'gatct: tcattgaatt tc 



2 1 0 > 18-S 

211:- 1188 

212:- DNA 

21 3- H . Sapiens 

400- 185 



aggctcgoge 


ccgaagcaga 


gecat gagaa 


ccccagggtg 


ectggegage 


cgctagcgcc 


h0 


.itgggccceg 


gegaggeget 


get ggcgggt 


ct cct ggt ga 


t ggt actggc 


cgtggcgctg 


120 


ct -r.ccajcu 


cactggt get 


gctttgttgc 


gcctacagcg 


ct gagct ceg 


cact egagee 


13 0 


teaggcgtcc 


tc ctggtgaa 


tctgtctctg 


ggccacct gc 


tgctggcggc 


gctggacatg 


240 


cccttcacgc 


tget cggtgt 


qa tqcgcggg 


cggacaccgt 


eggegecegg 


cgcat gecaa 


30 0 


jt cittg^c- 


t cc t ggacac 


cttcctggcg 


t ccaacgcgg 


cgc t g agegt 


ggcggcgctg 


3 60 


agcgcagacc 


agt ggct ggc 


agtggqctt c 


ccacc geget 


acgccggacg 


cctgcgaceq 


4 20 


eg -tatgecg 


gect get get 


gggctgtgcc 


t ggggacagt 


eg :tggcct t 


ctcaggcgct 


48 0 


gca r t-. tggct 


gctcgtqgcL 


t ggctacagc 


agegcett eg 


egtcctgttc 


gctge;gcctg 


5 4 0 


c ;gcccgag : 


ctgagcgtcc 


gcgcttcgca 


qccttcaccg 


ceacge t eca 


tgeegt ggge 


bOO 


tccgtgctg: 


cget ggegg t 


gctctgcct c 


acc t cgctcc 


a g g t g c a c c g 


ggtggcacgc 


6 60 


agacact gec 


a g cgea t qga 


cacegtcacc 


a tgaaggege 


tcgcgctgct 


cgccgacct g 


7 20 


c accccaqtg 


t gcggcagcg 


ctgect cat c 


cagcagaagc 


ggcgccgcca 


ccgcgccacc 


780 


a^gaaaattg 


geattgetar. 


t gegacett c 


ct cat ctget 


t.tqccccgta 


tgtcatgacc 


840 
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■ - 2 1 0 
•:211 
■:212 
■:213 


• 1 y 6 

• 303 
PPT 

H . Sapiens 


•v.00 


• lb 6 




Met 


:;i y 


Pro 


Giy 


Ala 


Va L 


/■.la 


Leu 

20 


. ^ e r 


Ala 


C.iu 


Leu 


Ser 


Leu 


01 y 


Hir. 


Lou 
6 5 




Val 


Met 


Val 


:: i e 


Gly 


Phe 


Val 


Ala 


A! a 


Leu 
100 


Arg 


:yr 


Ala 
j 15 


Gly 


Cys 


Ala 

130 


Irp 


Gly 


Ser 
1 A 5 


Trp 


Leu 


Gly 


Pro 


Fro 


c; lu 


Pro 


H.LS 


Ala 


Val 


G 1 y 
180 


Leu 


Gin 


Val 

195 


His 



Met Gly Pro Gly Glu Ala Leu Leu ALa Gly Leu Leu Val Met Val Leu 

5 10 15 

Va L /:1a Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tyr 



Thr Arg Ala Ser Gly Val Leu Leu Val Asn Leu 
4 0 4 5 

;r Leu Gly His Leu Leu Leu Ala ALa Leu Asp Met Pro Phe Thr Leu 
,0 5 5 60 

Leu Gl/ Val Met Arg Gly Arg Thr Pro Ser Ala Pro Gly Ala Cys Gin 
7 0 7 5 3 0 

V,L -1^ Glv Phe Leu Asp Thr Phe Leu Ala Ser Asn Ala Ala Leu Ser 

90 9 5 

il Ala A 1 , a Leu Ser Ala Asp Gin Trp Leu Ala Val Gly Phe Pro Leu 

105 110 

Ua Glv Arg Leu Arg Pro Arg Tyr Ala Gly Leu Leu Leu Gly 
12C 125 

Ser Leu Ala Pne Ser Gly Ala Ala Leu Gly Cys 
135 140 

Ser Ser Ala Pne Ala Ser Cys Ser Leu Arg Leu 
150 155 160 

;iu Pro Glu Arg Pro Arq Phe Ala Ala Phe Thr Ala Thr Leu 
165 " 170 175 

Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu Thr Ser 

185 190 

L^u Gin Val His Arg Val Ala Arg Arg His Cys Gin Arg Met Asp Thr 

200 205 
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90 0 
9 60 
02 0 



aggctggcgg agctcgtgcc cttcgtcacc gtgaacgccc agtggggcau cctcagcaaq 

rgcctgacct acagcaaggc ggtggccgac ccgttcacgt act ct c t get ccgccggccg 

ttccgccaag tcctggccgg catggtgcac egg^tgetga agagaacccc gcgcccagca 

tccacccatg acagctctct ggatgtggcc ggcatggtgc accagctgct gaagagaacc 10B0 

ccqcqcccag cgtccaccca caacggctct gtggacacag agaatgattc ctqcctgcag 1140 

cagacacact gagggectgg cagggctcat cgc:cccacc ttctaaga 



1188 



BNSL'i'X":ip -:WO ■:l.Mi4,' .iks I 



WO (» 1/36473 



PC T/l SOU/31581 



Val Thr Met Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
2:0 215 220 

£rq G 1 n Arq Cys Leu lie Gin Gin Lys Arg Arg Arg His Arg Ala Thr 
225 230 235 240 

t-cc t ■ / s Tie Gly He Ala He Ala Thr Phe Leu lie Cys Phe Ala Pro 
245 250 255 

l7 v v/T Met Thr Arg Leu Ala Glu Leu Val Pro Phe Val Thr Val Asn 

260 265 270 

Ala Gn Trp Gl y He Leu Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 
275 280 285 

Ala Asp Pro Phe Thr Tyr Ser Leu Leu Arg Arg Pro Phe Arg Gin Val 
;;90 295 300 

L-u A L a Glv Met Val His Arg Leu Leu Lys Arg Thr Pro Arg Pro Ala 

. ■ 315 320 



20 5 



310 



er Thr 


His 


Asp 


Ser Ser Leu 








325 


eu Lys 


Arg 


Thr 


Pro Arg Pro 






3 '10 




h r Glu 


Asn 


Asp 


Ser Cys Leu 




2 55 






210: 


187 






ill 


2 9 








ei;a 






213:- 


Artificial Sequence 


220. ■ 








221> 


rnisc 


feature 


2 2 3:- 


Novel Sequence 




5 87 







330 



335 



345 



3 50 



360 



::ataagctt gccatgggcc ccggcgagg 



29 



210; 


1 H8 


2 11. 


2 8 


212:- 


L'NA 


: 2 1 3 : ■ 


Artificial Sequence 


:220> 




:2 2 1 : 


mlsc feature 


:223- 


Hovel Sequence 



■:4 00> 188 

jcattctagci cctcagtgtg tctqctgc 



28 



H0> 189 
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<211> 2 0 
■■212:- DNA 

ArtificiaJ Sequence 

■::\20-- 

■ ::21> n.isc_feature 
•:22>;> Novel Sequence 

A00> 189 
v qct .cict t ttgcgcctac 

• : : 1 DTJ A 

13> Artificial Sequence 

■:.'-?('):• 

:221> misc_f eature 
;22.i:- Novel Sequence 

;400> 1 JO 

ttgqdcqcca qgaaggtg 
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pcr/ustmyjissi 



SEQUENCE LISTING 



<110- PhsnnACi* £ Upjohn Company 
Vrjgr] i , Gabriel 

Huff, Tiiti 

?e-j J i L'.2 i TOfEC 

r-lightojri r Jejrry 
E:chtULn r Kathleen 
tarinigan, Chxic 
Ruff, Valeria 
Kayte-s 1 , P-avl 
Wood, Li nil;? 
P?,rc-r'i, Luis 
Hiebtfhj T:r<n£ Id 

<120j NovkI G note in Coupled Receptors 

<12u> 0d2rlPHfcM29G 

<150^ CO/1 t-:3fl 
19 3*- 11-16 

<"i SO; 60/1 96, £.6£: 
<1S1> 10 00-04-1:0 

<1S0> £0 /I EG.. 07 i 

<i*o> t.o/i t«f-~.. Fj'-'K; 

<1?-1> 3 9S-<i-l 1 -1* 

<ZLSO> £0/173, 3*6 
<1-.L> 19* j- 12 -IE: 

<1^0:< 60/1*4 , II? 

ZOOO-OZ-UZ 

vilLOv 60/3 35,42} 

<1^.0> E>0/1£6 ( f j30 
<1S1> 2 0 00-0 3- 01* 

<ISO> rip/l£e: r 3 13 
<"i.Sl> 2 IJ OO- 0 3-0 J. 

<l"-.0> bU/lB£: r LI4 
<1ZI> 2 0C'O-O3-rj? 

<15Q> GQ/l&O r 310 
<151-* 2000-03- "I 

< 15 0:- 90, KO'li 

■as:i> 2 00 0-03-21 



P;.i.r?c I 



BNSlXJGlD --WC) Q}M'A7'HAJT\ 



WO IJ 1/36473 



pCiYustwyjiNti 



<:LE.O> 50/ 201, 190 
<L51> 2000-05-02 

<15D> CP/ 2 03, 111 

<].^r>> m 1/20 ;r , fj'Jd 
<1B1> i'L.i5-Ofi-25 

^170:- latent In version .3.0 

^2L0:- 1 
^Ll: 1182 
="2L2> R« 

J I J Q > 1 

Gt.ot^ggq^t ggqg^atgct gggacfigggc? t-^aattgcct ga&gca*gtg ctctc-stccc GO 

cct arjctcct gctgatnt-^g tt ggijgctir-r: .:vjflqtq'j(]uci. y vi-.\:*r.i:<jiivvc- L? fl 

t tctctqc re t tacrqlriL t ^CfOOaU^arf .MLCtcagfrtc- qac ttacitoa ^gat^tjjeca lfcu 

cg-anut tt r/o ut^y't"'; (% t ■- t ctgqgccaca £tt cctqgc c ga g^,ga£. a tj£. gg^gr^aat^e 2d0 

ijtj)Ly.n_i.-:.^t;r-: ttcttcactc ct Bg ggeca t gtggtsgagc tgc=.gtcgcs cctncttctg 300 

ticaatagqca tagatgagtg ggtt gagman ngagt tgece ic^c-cg'^.jco. gconcaggta 360 

ccq Itcc-jrc acta nrj tana gg t-jtiuz-jot r- •;tmiO->?n]Ci'- .ii-r-tyi:^'':;"^ t p." i^tjt I.. J 20 

aag ga.j ggr'i-j gt^c^-jn^t « q^gsjoaagcif. occaatLgaQ.a a cagacac rig tacge.^ga t,r: 0h!'< 

ttt^r'^'J 1 r -*M L'lt'jeft-^gt-rc gtgqgoatcg itaacctcca gccitypctc ctqcatgttc LIO 

o*i.c?r.tt.ro& dtctrjctgqc tgtqcatgga gqcaatcttg ageatgtege agtagaagaa; fidO 

LS^ddiq.dug agcatggctg ggaagaaqDC -j-acgesggag dqngtc^gc£> cg-sart garg CG0 

gtgaaatacca gc.aaagaagc tgc-actgccu tttgtci-jgca 'I t ct^L't 'itjr ar.TatPSTn.t "'^O 

t GCga^tgrg a.g'JilB.iJCCJy kij»>jqt:.&&qA O.iCt&A^nde dgf^CCfg'-a A tQ^H&QOfiKir: 7lji1 

icrgc-CQc ""O^ixl ■■:.^T-^ei L crr.T; r-a^gt a g^-g yaaqggc t yctt c, at gc.- taeg^t.^tLit: a*u 

r7t.!.':y.ri^-j4t q -itca^dT/jd ccgtgaggac .sgaggcagct qcggaggaag tgacaa^t^c 3 DO 

Liftt.ccg^ii^g ctqcacaggg tctt.ctgt:gt gggccgagaa ggrjet i7.n0 g^ gctggtctct 360 

gagtaggecs gagatqgeca cQ^aat caa 'igt ^toancc ara'jtwi-it tc^nn^t gc? 102'"' 

gca gag j ct g ac-^cw t t t r: l_ I ■> 1 4 q f t C p t':.^gC h Q-:- } ct: a*r 3 q c:t ^ t gt. 1 1 J A 1 1 

nttaqt.in;. %j s.tq m gq, Lj dggLLaggac HyC3aggatc actccaaatg agaaagatga LliO 

tt r:c.^tgi'- r :l' ugaagtQqca ggacttcact taccagggca tg 1132 



wo pcr/uauo/jistj 



<21u> 2 

<211> 335 

<212-> PPT 

<213> }J . Sapiens 

<4 00> 2 

r;j u £ e , r .qer Phe &ei Phe Gly Val lie Leu Ala Val Leu Ala 3ex 
1. ' ' ' ' G 10 15 

Lftii IJe 71* Ala Thx Asn Thx Leu Val Ala Vel Ala Val Leu Leu Leu 
' 20 21 30 

lie hi^s Lye Ren A£i> Gly Vsl 3er leu Cys Ph* Thr L<?u Asn Len Ala 
3,5 ' 4 0 

Val Ala As^ Thr Leu lie Gly Vb I A I lie r?er fily T.^u L*u Thx A.-^. 
bO 5-S 60 

GIp Tyf-n E^i- Pro Shi Arq Pro Thr Gin Lya Thr Leu Cys Sex Leu 

fj 5 70 " 75 HO 

Arg Met Ala Phe Val Thx Ser Ser A J a Ala Ala £<?r V*l Leu Thx V#L 
3 5 30 95 

Wet l.etu lie Thr Phe Asp Arg Tyr Al* 7 1.0 Lv* ^ n p - >: -' P* 1 * 7 ' i: "<J 

100 ILQ 

Tvr L&u Lvs T1l: Mst Ser Ely Phe Val Ale iSly Ala tya lie Ala Gly 
lis 120 125 

Leu Trp Leu val Ser Tyr Lou II* Gly Ptic L^u Prv t,-u Gly 7 lo pro 
130 135 Ull 

Wet Phe Gin Gin Thr Ala Tyr Lys Gly Gin Cye ser Phe Fhe Ala Val 
145 ISO l- r >^ 160 

Phe Wis Pre Hi* Flvs V,^l L^u Thr Ltru Eex Cys Val GLy Phe Phe Fro 
]hh 170 1 7 !j 

AId Wot Leu T.etu Fhf: Vfil Fhe Phe Tyr Cye Asp Met Leu Lya He Ala 
130 13 5 150 

K&r Hftt Hie Ser Gin Gin He Arg Lys Met Glu His Aln Gly ALj Met 
195 20 0 #05 

Ala Gly Gly Tyr Am E^r Pre- Arq Thr Pro ^r 1 Asp Phe- Lya Ale Leu 
210 ' 21* 220 

Arn Thr Y.il r^.r Val L-eu He Gly Sex Phe Ala Uu £Jsr Trp Thr I J io 
225 235 240 

Phr- l.^u He- Thx Gly He Val Gin Ala Cys Gin Clu Cys His L^i.> 

2 4 5 2 5 0 > 5 5 

Tyx Leu Val Leu Clu Arg Tvr Lvu. Trp T..^..j Leg Gly VaI Gly Aari Ser 
260 ' 270 
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Ty^i- I.r 'j A c ;n Pro Leu lie; Tyx Ala 'J'yjc Trp "jItt. Lya Glu Val. Arg Leu 
2/S 230 

GIt, Leu Tyr Hie Met Ala Lou Cly Val Lyc Lys Val Lou. Tlir Phc 
2£«0 2 95 -3 00 

Lai- Lp.j Phe Leu ftJ.a Ar<4 Asn Cyi Ely Pro GL> Arg Pro Arg Glu 

c^r 5nr Ty-, JH i lift val Tbr lie Eer Sex Ser Glu Phe Asp Gly 

<Zh 230 23b 

<213-' Sapiens 
<: -1 C -0 J 

cagcgcgagc gcct tost >*7 t,ii*<xMtgL<! cate^ctgg cagr_gtetc;<-; gtg*- cm 

ntnc^f.-ctn-i ^.jt.rq^cgtga ggc&gagcsc cgccagcggc sgcacgaagc ccacg^catg 

r;-.iLic-3tg.^v] Qi.c^c.qcto cgaagcgcgg acgctcaggc tcsggcggca ggcgcagcgs 

^•^u.jrt^.jLi^ sacgcuctgc tgtagccaag cc^cgagcag ccaagtgcag cgcctcagaa 

a^ccdi^cgac tgtccccacrg cacsgcccag cogcaggccg g cat agency tytcgcaggccj 

t=eqgcgt;ag cgcagt gggo. .mvctavlQO vvqooAGtvq Lcty^tlca gtgoi^'-^at': HtfO 

g^tcapc-icc n.-nLti^qiic-g ^nao-gaaggt c ; t ecaggaag ccaatgactt ggcatgcgcc 420 

n.jijt-.^Lio^rt^ ggtgtccgcc cgcqtdtcac accqagcdgc gt caaggric* tgt ccag cgc 

cqccascagc sggtqnccca gnc^ong.ottt cseca-jgegrj .-'0'>c; tq^t^ ^t-v^ar;t<j^ij 

Cdrticcdtc soc^rirag^; oocifiCdaoaq t;^ti^l;v:gr;<:g yg c;ccr:6t gg cgcLagA 

<:211.^ 21? 
* 2 1 2 *■> PRT 

•: 2 1 3 > H. „ £ ap i ©ri a 

c :er Hd Met <^lv Pxr- niy Glu AJ-i T.^u L*u A!y nly Leu Leu Val Met 
1 5 10 1B 

Val L~u Aln V.*l T^.ia T.*u L*u ssr Jssri Ala Leu Val Leu Leu Cye Cye 

o, 2 5 3 0 

r,^^ Ty>i f»er Ala Glu Leu Arg Thr Ajg Aia Ser Cly Val leu L^l> Val 
3 5 "40 45 

Aen Leu Ser Leu: Gly His l-*..> T.^,.-. T^u Al* Ltau Asp Met Pre? PA* 

SO ^ 60 

Pace -h 



120 
130 
2-0 0 
300 



4,3 0 

f-0 0 
e>5 7 



WO 01/3*473 PC'lYUSCWmsSI 



Thr Leu Leu Glv V-ni M^t a^Q Gly Th* Pr» ^rr Ala Pro uly Ala 

G5 711 15 30 

nj n v,:.] Tie Glv Fhe Leu Asp Thr Phe Leu Ala 5 ex Aan Ala Ala 
SB 90 ?5 

Le.u Eer Val Ala Ala L*u Set Ala Asp Gin Trp Leu Ala Val Gly Phci 
10 0 105 110 

Pro Leu Axq Tyx ALa Gly Ary Leu /'■xij Pro Acs Tyf Ala Gly l-eu L^u 
115 " 1*5- 

Leu Cly Cys Ala Tip Sly Gin f?er Leu his Phe Sex Gly Ala Ala L«u 
130 1^ 

Gly Cys S'-r Trp L*ii Gly Tyr Sex Ser. Ala Phe Ala Set Cys &ex Leu 
iJ.S 150 155 1G0 

Atq Leu Fx* Pro Glij l>lo Clu Artj Pxc Axq Phe Ala Alcv Fhc Thx Alt* 
165 ' 170 1">5 

Thx Leu Hlfl ALa Val Gly Phe Val L«u Pro L^v Ale< Val T.ou L«u 
LB0 

Thx Ser Lgu Gin v# l. Atg Val Ala Axg Arq His Cys Gin Arg Met 

Tlir Vrt 1 Thx Wet L-ys A1b Leu Ala 
215 

<21Q> 5 
•-.211 > *22 
<212> DNA 
<213> K , Sapiens 







tcctttgtac 


a t ccct ca;ca 


cgctgttcga 


a t qgcstt tt 


6 0 




tctgtctatt 


ttggct cact 


actqactatc 


trjttntgtnc 






tataacattg 


tcctcatcog 


ct □ tgn t 




Lol.i::a;irt L.qt;: 


I,. L.L/iyy L.t.qa 




aca Cist t a -J c 




!..« y'rfi L. til 


tcjtaaatgta 


t3 




222 



-210: 

■T2 1 1 : 
=:2 1 1:: 
■-Z I os. 



PPT 



cya Ala sly VpI IIct Sr-r 11* P^C? "^^^ Tyr lie: Pic Hie Thr L-eu l?he 
L ^ 10 15 

G 1 l> Trp Aay> Phe Ely Lya Glu lie Cys Val Phe Trp Leu Thr Thr Asp 
20 2 5 -3 0 

pjyo 5 
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T'yr Leu Leu Cys Thx Ala S^r Vol Tyr Asn He Val Leu lie Sex Tyr 
A-=Lp M-q Tyr L-ftu Ser Val Str Asn Ala VaJ 5er Ar7 THr Hi!? Phe He 

BO 55 *;o* 

fro li&u Arq Arq Leu Cya Lys Cys He 
65 " 10 

<210> 7 
<31).> 50"? 

< 2 1 ' • > }3 , f? 0* p X ft rl £ : 

<JC!u> 7 

gacgtcc-aag caggtgatga tgcccagggc gtgcaccggg taggtgegAf, ogqtg its." 



ISO 



300 
360 

4 BO 



cagcgg^gac sgggcggtca qgagcaqcaq ccaggtccct gcacacgcgg ccacc-gcqLa 

acgatxrcicgg cgccagcgct tgg^gctgag c^ggtacagg atccccagga agcgctccac 

gctneitacag stcatggtga g-iatgctgnw atacsitgttt gcgfcaaaa^g ccacggtcac 24 f\ 

tccTLtticao ci'jOAO'.-^or; ogas+,3CiOOA gto.-jt^cgq ttgcaatgct agtagatttg 

q.4fl3g<3^!*«/: a- :^L:l.^yL->:-tJ eic a tvi a 1 1= tic* U-aw^e L-O *ggt trjat ca tgaagatgac 

c-caiiggggat. i-;tggc.eccca tgcc»-!ccjgf a criageacrccao a L^'jan not t noccyg 

gatgct gac2- gccgccacca gcgagtacac cacgggcagg gccarcgcga tcg^Lgggi-U 
ccgcsgcatc tgcancgtcg cgttgtc 5u ; 

^ 1 1) > ft 
'.-iii> icy 
OL2> rsvr 

<2 13> H- Sapiens 

Aip flsn Tin- T.-uj Gin Leu A*'*} ftsn Fro Ala I1b Ala Val Ala 

h 10 15 

l_^u Pro V3l VaL Tyr ser VaL Ala Ala Val Ser lie? Pro Giy F'.^n 

20 25 

Leu Phe Sex Leu Trp Val Leu Cys Arq Arg Met Oily no Arc: Se-r Pr^ 

7 r, .JO 4 5 

S-sr Val lie Phy M«t He Asn Leu Scr V.>1 Thr Asp Leu Wet Leu Ala 
5 0 55 

Y.;W T.-^u Prrt Pht Gin Tie Tyr- Tyr ni.s A$r> Ai q Mi* Trp 

Val Phe Gly Vsl Leu CyB Aan Leu Vat Val Thr Val Ala Fhe Tyr Ala 
Sb 90 

Facie & 



WO U 1/36473 PtT/LlStmmHtl 

Aer Met Tyr Lex S«r lie Leu Thr Met Thr C/o rirt Hrtir V*l GlkJ Arc] 

Phe Leu Giy 11« Le.L T'vr Pro I-Erui itr Sex Lye Ax-tf Tip Arc; Ai-d Arn 
115 i20 125 

P,rq T Y r Ala Val Ala Aio. Cys Aia Gly Tbx Trp Leu, Leu L.eru Lku Thr 
130 135 1 d 0 

Alfi Leu Pry ton rM 9 * 1 0 Thr Asp Leu Thx Tyr Pro val His Aid 

Id 5 ISO 155 leu 

Le-j Cly lie i-le Ihx tiys Phe Asp Vsl 
1G5 

<21L> S7(i 
< 2 1 2 > DNft 
■::213> H. Sapiens 

<400> y 

cccatgttcc tgctcct ggg eaycctcacg r.r.gtcrQfg^t ^ tij^tijy^agg cgccqcctac 6 0 

gcc^ccaa.ca tcctacCgtc ggggccgctc dcgctgaaaL tyLci^cc^c get c-t gqt t c 12 0 

goanggg^g-cr gacjgcgt ctt cgt cjgcactc actgcgt c " g t get g age-i^t. ciit^-iqrrc-t d 180 

p.:;tgL:l r .^ijfi.:]r. q r^;^rj K n,^ n c^fc ntjcwjcnjc n gj^g gecog cgcccgt ct c Ca St^g y C g.^ Z4 0 

C g C a ■ : g 1 1 y g g £ t. y £ C a g <T C O U L? ■: : \;. t. ij i: : !•-! 7 I.) 

^2L0> 10 

<2LI> SO 

-2L2> PfcT 

-2L3> H,5apierj3 

^ 4 Ci n > 1 0 

Ft n Mel. Phe T>*tj 7,0 i.j L^v GJ v Sex L*u Thr Leu Sejf Asp l*u L*u Ala 
1 5 "JO 15 

Gly Ala Ale Tyr Ale Ala Aeti lie Leu Itu .^i Giy Pro Lc-v T]ir Leu 
2 0 2 5 

l.y* Leu -Sex Pre Ale* L*u Trp Fhe Ala Ar^ Giu Giy -£ly Vsl Ph^ Val 
3t AO $5 

Aia LtfiJ Thr Alrt Str v*l L*u f;^r l.eu T.,ni.? Giy lie Ala Lea Glu Arg 

55 e-o 

S&r Thr Met Ala Arq Arg Gly Pro Ala Pro Val Kkj: Srr Cly 

65 70 " 7 5 HO 

Arc? Thr L&u Alp Met Al* Ala Ala Ala T'rp 
G5 50 

11 

<211> 6HFJ 

F-shjd 7 



bnsi>: jod • w- ; oi.4C4/.»a?ti 



WO LI 1/36473 



pcr/usuottissi 



<2 1 2> ON A 
<2 1 3> [i . S 


ip-i t; jj £ 












■^DO:> 11 
ctqctc^ttq 


tggcctttgt 


getgggegea 


c-t a gqcaat.y 


ygC;Ks-: greet 


qrijtragtttc 


6 0 


tgcttccaca 


tgasgaectq 




aetgtttaco 


LLr.t<: aa-r.Lt 


ggi^tjtg^ct 


120 


ijatt test co 


1 1 at gat ctg 


uctgcctttt 


cqgacagact 


ittacrtCdg 


£C"y tag 


13 0 


LCpgtjcrl: tt*g 






ggyotcttcus 


cqt tggecat 


gaacagggee 


a o 


gggagcatcg 


tqttccttac 


gqtq^tqgi^t 


qcggacaggt 


;i 1 1 ■ tcdti^y t 


ngt COjCCCC 


3 00 


7-acon cqegg 


t gaacactat 


Dt cca cccg g 


gtggcggctg 


iica teg tct <j 




J CO 


■ ji^o-;; 1 1> or- 




uqtgtntcti: 


t t-qctgqaga 


accatctctq 


cqt gcaagrif. 


1 2 0 


is L L,g 4; Ont O t 








a tqgctqgca 


tgacatCdtc, 


JHD 


tt c.-v=icj^t*ji:- 






At^tcttat 


tttgetcett 


caogat tgt t 




tyi^grutqii 




qcagctLirt-.:";^ 


^^.•ii.:^i:-r^ Ot-C 


gn<?t Q r y jg.^.ot 


ggc-q&cccgc. 


ego 


tt citcatgCj 


t c,gt i^gcaat 


t gtgtteatc 


acit get a czlz 


t gc-:i:^-3CCjii 


H tetqet .aga 


bbO 


ctctatttcc 


tctgg«scggt 


qocctcgagt 


gect gegste 


crtctgtcfiA 




7 2i> 




tcagct tccc 


c-t Du^tgaac 


agc<jtgctqg 


atccx€tggt 


gtitta t lit f. 


7^r> 


r.coiiyC i:;Oc;t 


cc 1 1, t ooyo^i 


1;. t nt^J on ;.i c 


^□c?c.t caasa 


t ot g-c-agtct 


qdaacccaa^ 


H J n 








<.19!U3ogatgo 


caattt eg 







<210> 12 

-211 > 2^6 

:212> PPT* 

"2 1^> H.Eapi^rj^ 

<4 0 u > 1 2 

ijeu Leo lie Val Ala Pha Val L^u Gly A 1a I.caj Sly Gly V*l Ala 

1 & 10 15 

Leu Oys Gly Phe Cy£- i J h& i-iia Met Lys Thr Tip iyA Pro 5er Thr v.al 
2 0 2 5- 3 1 1 

Tyr Leu Phe Ajsji Lou Ala Val Al<* Asp Phe Leu Leu Met IU Cye Lfiu 

Pid Phe Atq Thi Asp Tyr Tyr Lftu Arc Arii Aip H.i? Trp Ala Pb*- tly 
5C ~ 5 5 " 60 

Aap Ue Pro L'ya Arq V^l Gly Leu l J be Thr Le-u Ala Met JLen Ax-g Ala 
f>5 70 7^ ao 

Gly Set lie Vol Pho Leu Thr V^l Val Ala Ala Asp Arg Tyr rhe Ly e 

30 95 

P^ge B 



WO U 1/36473 PC1V USUU/J158I 



V:il Val Ei_3 Pre Wis Hi* AT a Vnl A^n T*ir 11* bei Thr Arc; Val Ala 
100 105 110 

Al -3 Ply 31<> Vtil Oy^ TKf Leu Tip Ala Leu VftJ. 11 Le'j GJy T"h.r Val 
1 1 ^ 1£0 1 2*? 

Tyr L*u Glu Aen His L*u Cys Val Gin Glu Thr Ala Val Snr 

120 1JIJ 

Cya Glu Sex Phe lie- Met Glu £er Ala Asn Gly Trp His Asp lis M&t 
145 15-0 155 160 

Phfl Cln Lea Glu Ph* PJic Met Pro L*u Gly 31* He Leu Phe Cyz Ser 
1&5- "PO 175 

Pb* L-y* lie val Tip ser Leu Ary Arq Arq Gin Gin Leu aj.h Afu, Gin 

1 B i> iss i&& 

Ala Ax 3 Met Lya Lya ALa Thr Arq- Phe lie Met Val Val Ala He Val 

195 20 6 2 0E- 

Phr, He T'br Hys Tyr T.su PjtO Val Sfci" Al-^ Am Lxju Tyr Phe Leu 

v\n 21& 

T/p Thr V,U Pro Ser Sei Ala eye Aep Pto Ss-r Val His GJy Mm T>.:ij : 
2 2^- 2 <L. 2.4 1> 

his lie Thr Leu Ser Phe Thr Tyr Met Aan tier Met Leu Asp Fr-o Lt ll 
245 250 

Val Tyr Tyr Phe Swi Ser Pre- Ser Phe Pro Lys Phe Tyr Asn Lya Leu 

2 GO 2G5 27 0 

Lys 11«j Cys Sor Lou Lys Pro Lys Gin Pro Gly Wis Ser Lys Thr Gin 

275 n sgs 

Arq Pro Glu Glu Met Pra He Sex 

<210> 13 

v211> &10 

<212> DNA 

== 2 1 3 > K.Sapieus 

<40O> 13 







t,<- tc : cjt gafto 


tjLQ'At.-y u-li^y' 




ttrrtgtgcta 


GO 


ttgMCttcc 




ct actcacta 


gstgacaggt 


ggecett c-gg 


ggagc: tyCt'.: 


1211 


t gC'7iac?c-t r-jq 




gttr^tatatc 




qenrje^t cct 


gctgctgacc 


180 


Uj cefcrt f:l...j 


L-ig l : ,af: i":ayt i. 


T. a yg V LiT.g 


L.qi':i:rti":C!'j.HC. 


tgt.gL U:. T r.:L. 


y rcc-tacr-gq 


24 0 




st Gjcct gget 


gggcaccagc 


accacctggg 


cccrtggt ggt. 


cc\. c ca no 1 


301] 




tggccttctc 


ccacdCijqsc 


tacat caat^ 


qccaqstgat 


Ct gg r S. LqAf: 


<t,i} 


at QdCCdCicc 


aagagastt t 


tgateggett 


tttgcctacg 


gcat aqttct 


gacattgtct 


4 2H 



Page 9 



BNSfj'. on ..-w; 



I'M J64 / jAL-'l I 



WO PCT/USUIftil5«J 

ggctttcttt ccctcctt gg tcattttggt gt get at tee ctgatggtca gga eject gat -IBO 
Cddqiccaqan gaqaacctca t Havjgnc-arrg b'10 

'211> L7C 
<^:i-^> PPsT 

< J u tt> t A 

"Tip Ber Cys Ala Thx Thx Tyr L«u Val Aan Leu Me t Val Ala Afp Leu 
1 5 LO 15 

Leu Tyr Val Leu L^u Pro Phc Lou. 3 In: rie T'hr Tyr S^r Leu Asp Asp 

20 JO 

Ai-cj Trp Pro Phe Gly Gin Leu L*?u Cys Ly* Leu V.sl Hj.-j PrK? L*n Phi? 

Tyr lie Aan Leu Tyr Gly £©x lie Leu Leu Leu Thr Cye lie Dex v^l 

His Gin Phe Leu 01 y Val Cys His Pro Lgu Cys r Leu Pro Tyr Arg 

Thr ^rt> nis AJs Trp Leu Gly Thx Sex Tnr Thr Ti'p Al« T.mj Val 

EiL. 20 

Val Lisa Gin Leu ijeu Pro Thr L^u Ala Fhe ser Kis Thr Asp Tyr lie 
100 105 110 

Abd Gly t»ln Me t lie T'rp Tyr Asp Wet Thr Sex Gin Ola A en t^he Asp 
115- 120 125 

Arc- Leu PhE Ala Tyr Gly lie Val Leu Thx Leu Sex Gly Fhe Leu Sex 
"130 * 135 140 

L*i.> T.^u pj v "is Ph& Gly Val Leu Phe Thr Asp Gly Gin Glu Pro A^p 
1JE " 1 E]0 155 160 

Gin Ma Aia Gly Glu Pro His ulu Asp Arq 

no 



210? 


15 


211? 




> : . "i 




;>i:^ 




220> 




2:i> 


tni^c: feature: 


2 2 ?: 


L 4 31 ) . . (461) 


2 £ 


rt is .any nucleotide 


■J l>0:- 


1 5 



t-aqe 10 



WO 11 1/36473 



PClYUSCWttlSSI 



ccttv^gctgcc t F^cOtTO-^agt equo^usi^t,;* q^qt^^'ictcg cancccscga aca-gcagccc 120 

DaPvJgctgg ctcoitut tCo og^totclCftt 1 Ujcj.j^cigg qgcatng~gc tntj-qcqcacq ISO 

gnEOtOcac-ct ggg^te-cjcug pcC^Q't^f: y C ^ t:- >; iq ct ggggccttca gccggtqccg 2d0 

rtflciiiicKiCu q^tiflci t^y ci-ijccacaziqc gseac^oat 9 atcttaaoa^ ycgocj^cgaa 3 DO 

gcccqcq/anq gcctcata gs acycgtacac ctqcaegtgc o^ncqct<]L^. qc^jr^c^a* 3 60 

qatccagtgg cagcgacgce tcccc.gg.cca ggctcggqcg ga^agtggcy crgcctc-grt.^ 4 20 

oarjniqacgt t rmnn Finn ruin nnnrinnnnnn nrjnnnnrjrjnn nagt act age gcaccac-aaa ^ K> r> 

COc^'^cuyt cqci^'Oaqtr iieagt^i::*:^ n^.:]Gr:Ltqcoc: j ggcggcg a qggcacgcgc 5-iLJ 

^qL:s^c:i.!iju t:^ y c ^ y L ^9 frC; y y Ms^-g lj.^c [-.c-c^Oiit-iAjy t> j^t ngangg cgatgagcac 600 

cacgaggtgg cicci^aggcgc rccgccc-gga tocctycsgc Jigc-t gcflL-q« .■a<ji:vv:j r:r>.^^e *?60 

caggtceccc gtggcc^cac ggggctccicc cagc-dgttcc ca-ggccagct: qty^cag.igL: "?>n 

cgtncccccq cacnqTtaw qqt cc^c-r^^ gtccc-sgct gc- o.e-cagcanga agtccatctt 

rjcy LLcgrUiJi Einnrjtjnfjimrj miimmmrinri rinjjjinntjrtac a^Lryy i-mch unfiol. "jtgq't. EM n 

gttgccttfcc acc^ccacca ccsggat gac- ccccaggsac accagqc-iiga cgc* H^M 

<210> 1 6 
<211> i96 
^212> I'KT 
<2i?- : K. Sapiens 

<22 0:- 

-271- Vl-iSVF'Z 
- 1; - : : i5 ) , ■ 

<'22'<> Xi£ is unknown 



<2 2 3:-- Xa-a is Unknown 

Ai-'Q- Vdl Axg Leu Val Fhe beu CXy Vai lie- Leu vsL Vai Ala val Als 
1 5 Lu 15. 

GLy A3 a Thr Thr Vn L L*u Cy& Ary 1-f-M Xaa X^a Xap Xan >'n.> Xa-a 

&z\v SSija tin™ Ly."5 Ar<a Axg Ly£ Met Asp F-ht: Lreu L^u val Gin Jjeu Ala 

Leu Ala Aap ieu Tyx Ala Cys £ly Gly Thr Ala L*u 3er Gin Leu Ala 

Pago II 



BNSLK.,CI[; --Wl. ; 01^ J 47.^A2T ! 



WO 01/36473 PCr/USCI0V31S»l 

SO 55 61] 

Trp Glu Leu L*u Gly Glu Pre Arg Ala Ala Thr Gly Aep Leu Ala cys 
65 70 75 ao 

Arq E'b* L*u Gin Leu Leu Gin Ala Ser Gly Arg Gly Ala Strr Ala His 
35 50 35 

T,ru Vvl Val Ley lie- Aid Ley Glu Arq Arcr Ax g Ala Val Arg leu Pre 
100 U>5 no 

His Gly futf Pto Leu Fro Ala Arq Ala L*u ALh A La Leu Cly Trp Leu 

US 120 125 

Leu Ala Leu Leu Leu Ala Arg £ly Ser Gly Fbe V.=sL Val 7W-ft T'yr Xac 
130 13f> 

Xaa ;:aa Xaa Maa Xaa Xaa i!aa J'aa Xaa Itsa Tbx Sex Leu Gin Pro Gly 
14 5 150 155 1^0 

Ala Pn> Te^u Sur AlA Arq TMa Tip Pro Gly Hut Arg Arg Cys His i'rp 

lit: Pht: Ala Leu Leu Gin Axg Trp hlfl val Gin Val Tyx- Ala Fhr? Tyr 
L£0 1ED ISO 

Glu Ala Val Ala Cly Fbe Val Ala Ero Vsl Lys II a Met Gly Val Ala 
195 i:D0 205 

f v s GJy Kis Ti^u T.e-u r?ev v.*l Ttp Tip Aix] Hi.-?; Arcj Lbu Lys Ala Fro 

Ala Glv Ala Ala Ala Trp E?ei Ala Str Fro Gly Gly Ala Arq Ala Px-o 
225 230 ^d0 

Set Ala Met Pxn Arn Ala Lys CLn Sex Leu Ly£ Wet Sex Gin Leu 

2 45 25 0 2 55 

L^n Glv Leu L.ui Phe Vy J Gly Cys Glu Uuj Pry Phfc Ala Asp Axg Leu 

Glu 711^ &lii Tti> fler T>ttr: ,sl v PVo AIh Gly Gly Trp Glu "Sly Glu Ala 
:lt$ .'(3 0 ^H5 

Leu g«r Alii cys cys Als Trp Txp 

<21D> 1"? 

<:2±1> 601 

<212> DMA 

^ j 3 > R,S ^ pi.*? r 1 S 



l, .::l,a^.3t-ttt tctctgaact ttgagcctgt gaaaaaagaa gggatgotgc ctCdggcc^c 
i:<r.rz$^r.crtzq atdCtcactc tgagtgccat gaggtagtag aggacactga tgacagtcat 12 0 
ggggaggagg tagaatagga euiyaggt gtfC Ctn^U^ t<] at^U.^i- t.:,:;^ t0\^ "I HO 
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YVO 111/36473 



PCT/USCJl*y315«J 





Lit. id CfiQr'A V CQ! 


r r:p£s3 ectgs 




O-aqgnac 


ecc.t tq Tcrr 1 ^ 


;.i.yt .3gtgqaa 


24 U 




t^tf* tget gg 


Ly t tgyy CSs] 


Liga i] n q g 






rj:L:-^ga.cga t 


3 OU 


gecgay y« t_o 


tt-CjiSgCjigL-'CL: 


qgcgorgcigt 


CiCt 




tt I JC" 


•-q ni'-ci,?: 


5".-^jgtgtag 


3 tU 




taifcgctL ca 


cget qacgqt 


£gt Q'itgLit.^ 




1. 'jy^yq 




4 20 


gg tctcaaag 


agggcegt ct 


tgaagt agci 


gcc-cacgggc 


ccga 


,=5-1 a a.Cic; 


aAg^yt^gt h 


4B0 




etc 


teat -5ig.se ct 


CQ3C;gggr7i5:t 


t cc-aaggagc 


agga 




ggt ca yagac 


S^O 




EL^g-irq^ t -sgt- 


rigt t t^q trrr-q 




cttcata 


gect ggt get 


q caqastca c 


SliO 






iir>jy CH 1: 1-4 T: 


■ jrtj-j ^ tjrt i": i^r> . 




Cyd.t^.;-s^ i-j..-:.:: 


fi.t-t- grji>3 CP.t 


3 naeeacaga 


G t-0 


cacgggrjagLi' 


^Aijhii^ tgg ^ 


tqcyc g 6 L- g 


t t-r 


qL:agrtiiy 






a c; tec* egg t 


720 


.jctgttL- 




tgtttctgga 


•at qgatctt c 


taqtttctgc 


tgg tags tec 




700 


ct gaagt tt t 


tec at c cctg 
















^ D l. 


■' 1 n :> 

< M 2> 


PP. 7' 


















<4 0u> 


IB 




















3© r L'ly 
1 


Wet 


Glu Lys Leu Gin Asn 


Ala 


Sex Tip 
ID 


lie 


Tyr Gin 


Gln 
1^ 






Ltu G 1.1.1. 




Pr-o PKk Girl LyS "H i 
20 


L-ru 
2 c . 


Kit* 


Tbr 


Glu Glu 
-; r; 


Tyr 


Leu: 






35 


eye Giy fi'J Arg 
4 0 


3er 


H±a phe 




LtrU PtCi 


Val 


5 ei 




VaL V-sL 


Tyr 


Va.1 i ro Ii«=< Pr.e Val 


Val 


Lily Val 


He 


Gly Asn 


Val 


Ll:Li 




Val Cya 


Leu 


Val lie Ley Gin 13 it- 
7 0 


Gin 


Aln t+^t 


Lys 


Thr Pro 




Thr 

GO 




Tyr Tyr 


T^u 


PlTitr ^t*! Li£.Lj A1m Vtil 


^ex- 




Leu 


Val IftLj 


L^L] 


T .^u 




Ely Met 


FxC 


Leu Glu V 

1 0 0 


al Tyr Glu 


Met 
1 0 5 


Trp Arg 


Ash 


Tyr L'rc 
110 


CKe 


LeU 




I'he Cly 


Pre 
L15 


Val Giy C 


Tyr Phc 
120 


Lys 


Thr Ala 


Ley 


Pbc Glu 

li:5 


Thr 


Val 




Cys PhD 
130 




Scr Ir.] TlK 
1 j.S 


TJ" ( r 


TJti V* 1 


5ttr 
14 0 


L Gl 1j 




Ty-r:- 




i o a 


lie 


L^li >Ji£ Pro Phie- AlCCf 

150 


Al a 


Lvs Leu 


Gin 

13 


Sei Thr 


Arg 


Arg 

160 





BNSCn x.:i[j - WO 



01 M^/.iAi 11 



Arts Ala Leu flrg Tie Leu Glv 1 1* Val Txp Gly PKft Set Val Lfiu PTi* 

L6& " 170 1?5 

ier Lea pid ftsr. Thr fler 11b His Gly He Lys Phe His Tyr Phe Pre 

130 1^5 Lhi> 

&3n '-;lY Sar ley Val Pre Gly Svv MA Tlir Cys Thr Val He Lya Pjco 

195 ^PO 205 

t-j^r Tr f > Tyr Asn Phe He lit Gin Val Thi S^r Phe L-ij Flit:- Tyr 

" 2I0 21G 220 

Leu Levi Pr-o Met Th/ Val He £ex Val Leu Tyr Tyr L*u M*t M* Lou 

22b 22i0 235 0 



Arg val Lex 


lie Ala Gly Val Ala ■'■ 








24b 








<21Q- 13 










<£11> 222 




























<JQ0> 15 
atcaagatga 


tttttgetat cgtgcaaatt 


attggatttt 


cc-aact ccat et gtaatceiz 


(SO 


st/tgtDtatg 


catt tatgaa t-gaaaact tc= 


aaa a a aa a t q 


ttttgtctgc agtttgttat 


12 0 


t -gcatagt 




r-aaiini^n t g 


r^aa uttering nat t acaatc: 


ISO 




sa.^aa^qtt ttenctcaga 






222 


<210> 20 










<211> 73 










<212-y f-RT 










<213> H , Sapiens 








<4D0> I'D 










11 r I.vS Htt*- l' J * Ph* Va l 
1 & 


Gin ll* He 
10 


GlV Phfi Rftr AK11 ^r=r 
IS 




T I * rvs Aarj Pio He Val Tyr Ala 
2U 


^he Met Asj-j 


Glu Aejx Phe Lys Lys 
30 




Aeti Val lieu 3er Ala Val Cys Tyr 

3 5 4 0 


Cya lie Val 


Asn Lys Thx Phe Ser 
J 5 





Pro Ala Gin Arc, Hie Gly Asu ^ He ™* M*t MeL Arg Ly* Ly.^ 

5 0 ^ tC 



Ala Ly? Fh* Leu Arg Glu Aan Pro 



65 



7 0 



-ilO> 21 
<211> 1"i7 
< 2 1 2 > L>HA 

Pace 1J 



WO LI 1/3*473 PCX/USUU/J1SSI 

< 4 00> 5 1 



g<2'C ciC ciCC-d t 


qciqttttct 


q t ac] a a 1 1 c c 


act ttcitctt 


t «g C_ y r — : L t Ljii^i 




6 0 


L J 1 j L lJ U 


CDaQgatCa c 


ca C'jt a era at 


a qq a accq t g 


aggt 3ca tgt 






r>:«t ng cac-t c 


acaaatttgc 


■agn.-sgqigcag 


n;^ a acat c 


c-a a gtct: t ct 


t ga t '3 a ggt a. 


1 ri 1/ 












agt t s-a t gac 


2'1 0 


ego C3tt> Lilly 


^Lca.-;: tg.-bCC: 


gggtattoat 






tngaaatgac 


jOu 




CC gC''Jrtdtd d 


gcactatgaa 








3CD 


a ggat c yea a 


gaggaa tt cc 


tggaggtstt 


gtqgocaqgi^ 


at£ ct tgggd. 




4 2 0 




gr-accagaqt 


a* ct q □ c 











<' 1. 0:- 2;> 
<:;].^> PRT 

<-aorj> 22 

Val Sex Tyi i'er Gly Ala t>h»a £ex i J io p\rc- Gly Asp F'ht; 9na Pkj- M*f. 
i 5 ID 1£ 

Pro Cly Kia Asn Thr Ser Am Atn S-?r Cys A^p Pro H© Val Thr 

2D ^ 30 

Pir-j Ri£ Leu lU Str Lrf?Lj Tyr Pbe lit Val Ltiu II* Gly Gly Lft ^ 
3E- 40 

Gly Vail lie 3er He Leu Ph* Le*i Leu V*l Lys Met Aan Thr Arg Set 

50 55- 50 

Val Tbj: Thr Met Ali\ V a 3 Or A^n Lou Vo>3. v«i Yril Wis. Ser Val Fhe 
5 5* ~'CI 00 

Lr;:j T.r^u Thi Val Pro E-ht Arc; Lfeu Thr Ty* Lei) 3i« Ly* Lys; Thr Tj.-p 

Htrt Fh* Gly L*u Pro The Cy* Lys Phe Val 3©jt Ala Met Leu His lit 
10 0 105 110 

hie Met Tyr Leu Thr Val Pro Ho Leu Cyc Gly Asp Pro Gly His Gin 
11S T : >v - 

lie Pro Hi? T>jn T^-i- 1 Glr, Val Gin Ai-g Gin Eer Gly Ha t^'-i GLn Ly* 
131.1 j. 1^0 

Thr Ala Cye Cy* Gly 

:s4. c i 

<21U> 23 
<:211> 222 

PAg* IS 



WO U 1/3* 4 1 3 



<212> DNA 

•afrtq-acc-aaci y t cai^gqcat cqictqa^gc tagaaqgccs caqqaaatqc cui^tcaag^t tO 

qttqgcqect gcaatcqcaq ctacc^ccvon ut t ^acc^^a gc!C*ngn«gqg c-nrjqrcc-gc-- 1?0 

o-gcf].aac-3cg gtc arcoigc.? t>«?.S"jt(":L';at.t qc aciisgeacq yiy.^^i^ti^Cii CQAt.^LiCLrfra IfcKJ 

c^iCLj^ec^r't] cyqat tjootc eg-- 1 ttcaaa q-agqtactcs ca 222 



-: 21 L 
■: 212 

•: 213 

■:4 00^ 



'in 



Cvi Clu Tyx L*u Phti G I SO*- Tip nly lie 7-irq Letkj Ala J Tip Me 
.S 10 IS 

Ilr Val L^u T.«u r Val L^u Cys Asn Gly Leu Val j_i&u Ltu Thar Val 
:><] 25 30 

File Al.<s Rly Gly pro Ma t : r.o ieu Pre I'ro Val Lyt E:he Val Val Gly 
1*5 4 0 4f> 

Ala lie Ala Gly Aln Ann T>ir L-Cu T>it Glv r I.l^ ^Cr Cys Gly TU*u T.^u 
A-L? ocir Vol ft.-ip Ale L-en Thr l^u Vil i-^x 

■:211> 

<212* DMA 
21 3> JH - sapieriE 

^400-- 2f 

CiCJCCr:r^tr^ t: O I .^Hr^O^ I- i~: :-i£,CHQC qa i^Ctq CCjCC ti CDC^ ct cc t q c c; c T. ci £j Lc v>0 

! ^kjt^ji^q^ac': gc^actcc-tg cqqcaqagao ccgaqtgqct cccagoagtc ggcga^cgcg 120 

ectqayiqett ccgggggcct gcgccgctgc ctgcccccgg q cc 1 1 gat -rig gac^cttcacjo 180 

^gctccjqagc get *:atcgcc ccagcg-^gwc i^-^get qgncti ^cd.gr-gr^t ^ cac^w^qc- 2*30 
cccggt 

<210> 

<211:> til: 

<2~\2> PUT 

<213> H. Sapiens 

■: duO > 26 

Paqe 1 b 



wo m/M-ra ptr/uStJoyjiNtJ 



Aen Pro lie lie Tyr Thr Leu Thr A^n Arg Aeo Leu Axg Kis Ala L*u 
1 b 10 15 

Leu Arq Leu Vai Cys Cys Gly Axg His Ser Cys Gly fljrg Asp Pro Ser 
2D 2 5 3G 

Gly Scr Gin Gin our AAej S*t: ALs* Ala S.i.'u Ala Sex Gly Gly Lt-.u Arg 

J l J A 5 

Arg Cy.s Leu Pro Pro Gly l*u AEp Gly Sex Phe Sex Gly Ser Glu Axg 

50 5b fcO 

&er Ser Pro Gin Arg Asp Gly L*u Asp Thx Sex Gly Sei Thr Gly S^r 
05 "7 0 "7 5 BD 

Fro Gly 



<211> d20 

■c212> DMA 

•: ?13> W - Sapiens 

<22li> 

■"22 i > mi£C_feature 

<222> f SI j . , \ 106> 

^'£2 3> n 13 sii v nucleic send 



= A \i 0 > 2 
L'Clt ■ $ "j'^ ijl 


;-i 1} g L'^fi^Cb. 


r ga -c cage a t 


gtgcaccdcg 


coeqet-e t gc 


gecrgegat.^c 


fin 


tcgcrgggtcc 


qci-gcct cct 


nnnnnnnnnn 


nnrmnnnnnm 


nnnrinnt ggc 


aqagctt gcq 


120 


cgcgatgcgg 


gcqtacatga 


cca-cgatqag 


c-qcc^gogcvr- 


gcc-qggt aga 


t gt gcg-a gasi 


1G0 




•gtqtaqaccc 










240 






■ggc-OQ C-t-etiC 




tX!^LL:Ctf:^<-: 


c : g^t c-rtcijijlr. 


3 CO 


eiqtg tgaczy 


gccgagggac 


acatgatgag 


c-B-gcgcc a 


geccagatga 


cqgc^etggt 


3 6u 


gacgagcgcc 


t t ccgcagg^ 


tcsqcttctc 




t pejega t go 




i 20 



*'2L0> 29 
-21H> 139 
Z 1 L- > PRT 



<2£1> UNSURE 

<:222> [10-1 . . (1131 
<223> Xas is Unknown 



1 0 0> 2 8 

Phe Arg Cyy Eld V*> I H j ft Pro Fbe Arg Glu Lye Leu Thr Leu Arq Lys 

l'aqe 17 



WO IU/36473 PCX/USUO/315*! 

! 5 10 i - 

Ala Lp-lj val Thr lie Ala Val lie Txp Ala Leu Ala L*m lie tfet 

20 25 

r va Pr- Ser Ala V*l Tli i Lfeij Thr V*l Thr Arq Glu GJ.n Jlls Jii* Phe 
M- -t- V/^i A'it- AlS Arg Afin Aiq Ser Tyr Frc- Leu Tyr Ser Cys Txp G-iu 

Sr, !>!> tO 

Trv pk. Glu Lys Giy Met Arg Arcr Val Tyr Thr Thr Val Leu Lhe 

Hi a He Tyr Leu Aia Pro Leu ALn Lev II? V. a 7 V*l Met T-yr Ala 
9 5 M ^ 

Arg He Aln Arn Ly^ L^l. cys Xsa Xaa i^a Xsa Xsa Xaa > aa xaa Xaa 
2 DO L05 ilO 

yaa tin Ala Ala Aep Pro Arg Ala Ser Arg Arg Arg Ala Arg Val Vnl 
lir. 120 125 

KiE Met- L*m Val Met Val Ala Lit la Ph* Pb^ Thr 
130 135 

^21C-> 29 
-211; 316 
=:212:= DMA 

cjoatggggcg tqagtcctca ggcacttctt gaggtcottg ttgagcaggn. a^uaqacaat 
nqggttgacg geagectgg-g cg-a^ct c;?t. r^a».^3^ ^t^'K-i^q-jt- ^i^<-jt^j<j 



^ I.. Li;,*-": f; £i "I r I.1 



c-acagcacsg gctttca^aa ao^U-gno* <ii.^;.^cc "--Jf.*.^ 

gagcagnaag 5qc?qtt']Ua Lc-g^tJJ&Aa c& tQcqqccf: agct grtttt cacccttg^t 

^ti^ntC-L^L-q cr--i.agr.ag re gccggct^gc tgcatgccca ttctgccgqa tdecca^g 

ygtr.ggr.ggc stgggccc 

<2l0> 30 
<211> IOC 
•":2"!':> PRT 

<4 00> 30 

Civ Mp:t Pre Piq Thr Leu Uu, Glv A c 'l Glri Asm fi.ly Ni<: Ma 

i " 5 ill 1S 

Ala ipr At-h A^ T ^' J Xf? ' 11 nx V M*r_ Asp Glu Val Lys Gly «lu Lye Gin 

" : >/j 25 30 

i.ov '"iy Met T y- r Ala Ile rl ~ ir • Leu L " ,jl LtlEl Leu L,E,U Trp 

Page IE: 



GO 

1/ :U 



.U 13 

HOD 
31B 



wo u 1/36+73 pcr/uscwmssi 

5er Pro TVr Il-s Vol Ala Cy* Tyr Trp Arg val Phe val Lys Ala Cys 
Ala Vf*l Pin Hi* Ar-a Tyx Leu Ala "J'hjf Ala Val Trp Met Sex I'he Ala 



Gin Ala Ma Val RBin Fro lie Vsl Oys Phe Leu Leu Aeri Lys Asp Leu 

a & 90 

Lys Lys Cys lea Arg Thx El is Ala Pro Cys 
100 1.0 5 

-:210v 3L 
<21 1> 35* 
<2l2> DNA 
<213> ti . L- apiejia 

<-100> 3L 

tattctgtaa tg«.agcwfc rit <--v\- Uca o^ct: g^cfittggcra catceagtc;q tttcfeicct 

catt^tgaaa guccV Lc«ji;il- tgcitg t attg ccacttcstt tta^ag^t gcscaagtcc 120 

otcjgtgcutt trx:tf u^caa tgcaggtcat agtgaegstt tctgtcacaa cagcggtao;a 1B0 

nttjq^'^iirt^l gg.-:riCCdtct tijc:aaatqaa agcacctgca ci taaggaaat agciataaatc 240 

a La:atcaaa acasasagaa; taa-aggtttc otctqtgtct. t irr?t as t t^t cvictcvtcurjt 



3 0 0 



ccsttctga; cctctgccaa aaotgtt-t^t- c?aU.Q U*rtt:t £ o l: <- r. q t.z-q& ct^ca ^5 a 

^210> 32 

2 1 1 > in 
■212> PRT 
< Z \ 3'* H . ■ "^J Li; 

Val Tyt Arq val Lie Tht Tl* lie Us Leu Phr Sly Arrj fll.y S-^r Gl.i.> 
1 " 5 ID ^ 

Trp Thr A&p Scr /Sf.p JtRn Tyr Lya Asp Thr Asp Clu rhr t/he rL& Leo. 
v;G 2 5 3 0 

Phe Vrtl Leu Met Tyr Asp Leu Ser Tyr Phe Leu 'Thr Ala Gly Alu Phe 
35 -4 0 4& 

lie Cys Lys Met Val Pro Phe Val Gin S^.r Thr: Ala Val Val Thr Giu 

lie Leu Thx t£t?t Thr- Cvs lie Ala Val Glu Arg tils Cln Gly Leu Val 
£5 1'0 76 GO 

Bis Pi-o Phi^ Lys Met Lys 'Xrp Gin Tyr Thr A^ji Arq Arq Ala Phu Thr 

wet Leu Gly Glu Ala Thr Gly Cy* * * 33f1 Gl V Se3C VsL Aen Aa P iIe 

Paqe 1 'j 



WO U 1/36473 



PClYUSiMttlSSJ 



10D 1^5 HO 

Leu His Tyr Arq He 
115 

<21Q> 33 
<211> 621 
<212; DNA 
<213> K . Sapiens 

aqtytt'^l,^: atQMtottgc tcatggcgat gcactcgacg atgtagaagg caqt^aggta HO 
qir^cttctcc ttc-acaaaca cggtggggaa gnEigt ccjogc ac^atggtga aq^-crt^qaa 
gg^cq-cccag catagcacgt aggcgii t<aag q^tQwra^n. ayv&ccxigyv uOqt rttcct 
■gocigcagcgc; ,^0; tc:U,y e .^qAt-cr.^Lc t^j tctgoaa t ccagggaccg tcLtcAa.^ 
.^c,Ol.L>;i:qg' g a cia t cct g g cataqcacag ggtcatggtg accscggggc ccacgaattc 36 0 
tdtgcrcsasq ataaaqagga agtaggactt gtagtagagc tgctggtcca csggccagat 
ctzggccgcag -sagatctttt cctqgnt ot t g^ueatrfli^ agc^COirtct or^fcnrjtiiaa 
gtag«jcggaa ggg-atrj^* t^g^a Lg^a ^f^tccac accaay^caa tc^ygccayL .^OD 
ggut<ltttg*3 oa^L.t-.cat-tc gtggtctcag cggatcfgaca atagcsagat. accta^qgca 
aga^tfi^afcg tg-gaggcagc c 

/211-- 2 07 
<212> PRT 
2 1 3 •> H . feapi en& 

"ly Cy.J Leu Ris L^u CVS r '* r Cys Pre Arg Tyr Leu Ala He Val Hi* 
5 S 10 15 

Pro li^u Arq Pro Arg Met Lya Cya Gin Thr Ala Thr Gly Leu He Ala 
" 20 25 30 

Val Trp Thr Val S*r lie- Leu 11* Ala 11^ Pro Sur P-I.a Tyr Plus 
33- J[i 

Thr Thr ni-j Thr: v«l Lfiu Val lit Val Lye Ser Gin Glu Lya He Phe 

Cvs Gl" <Hjj He: Trp Pro Val Asp Gin Gin Ipu Tyr Tyr Lyt S*r Tyr 
ti. " 7 0 7.S isn 

?he Leu Ph« 11^ Ph-r- fSly Glu Phe Val Gly Pre Val Val Thr M&t 

as &o y5 



1 BO 
£4 0 

3 no 



420 
4 B0 



600 
£21 



WO U 1/36473 



PCIYUSUU/JISSI 



The l.t.] Cy?, Tvr Ala Ara lie Ajg Glu L>--V: Tfp Phn 7,,ys Ala Val 

U>;i " 10 5 110 

Pzo Gly Fbt Gijj Thi: Glu Gin lie Ara LyB Ar*j L^u Aid Cys Ary a>ii> 
115 120 125 

L'/a T'hr Val Leu Veil Uii Met Cys Il<a Leu Thr Ala Tyr Val L*u Cys 
130 135 HO 

Trp Ala Fro Phe Tyjr Gly Ph-a Thr lie Val Arg Asp L^he Phe Pjto Thr 
150 155 160- 

V.i 1 PI in Val. LvS '"1" Lyft Hi.* Ty r Lev Thr JU Phe Tvr 11b Val Glu 
16S 175 

Cya lie Ala Wet Lex Asn l^ex Wet lie Asn Thr Leu Cyfi Phrt. v,*A Tlir 
lBCi 19 5 190 

Val Lys Asn Asp "Thr Val Lys Tyr Phe lys Lya He- Met Leu Leu 
195 200 20 5 

<^l0> .35 

v- >:"i 1 > J H /« 

<212> r.'H/L 

<213> [] . Sapiens 

r;-j[jO> 3 5 

caqcceoact qc^gtqatga a.jtcsa.atgt ccsacaccad ccataqtcac cattactaac 



taagc^gucii cs-^^yttcc ottneaggnt nt t c-nqtcagc jqiggncaggg cccagggcag 12 0 

i^L:<:Ai;:rtC;^ lg a f:Aqt.l:<jat:£i cq 1 1 t t L t cig i a e-o ij; v» qt y cc .a taggoc^cag 1BD 

gacdQ-acaqq caaractcag tart gat g^c dL'tcagcitg ctcagoccla r^tfrj.igenivjc; 2 40 

aaE.qgtratc acrgct ggt qa agaagctagg qaaEittgatg qaqatigaac ay &£ga £L- t L 30 0* 

actqaggtac accaggcaat ttataatctg gaagcagsgg aaqagc.aagt cggc^-rreii-qr 3f>l> 

g-^r-t 9 ngq ■> -sr?t n f>a cr^g a.^-a n gncg tt c-ct g eg cat g cggsacccca qgagecagag -ii2l> 

L:iar-.iri *<rjr.i:q U I" t cz\.^s cziift ij<:<: izq ft c.c. tit; t rj«a a -sg r^t cagga agocccigsat "J S 0 

<21D> 36 
<211> 161 
<21£> PFtT 
<2 1-3> H,Siipi^tj' 

Leu Lie Pro Val Phe L*u lit Leu Phe He Ala L-eu Val Gly L^ii VtO 

1 $ 10 15 

Gly Ash Hlv Phn V,-l Lp." Trp- L*3U L«u Glv Ph* Arc. Mc=t Arc Arg Aan 

Pigs r 2L 



BNSCXXJlD -W(; !^^47./A2T I > 
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pCT/USUOttlSSJ 



Ala Fh& Ser Val Tyr val Leu Ser Leu Ala Glv Ala Asp Die L*u Phe 

35 40 45 

Len rva Phe Gin He II* Asn Cys Leu Val Tyr Leu Sex ftsn Phe L'he 
50 €0 

Cys S"r II r lit A-*H Pli et Pre o*i Phi? Phe- Thr Sex Val Met Thr 

f.V ' 7 0 ^ BO 

Fh* Aio Tyr Lou Veil G.lv T -^ J 5*je- We L T --^ J S^x He Sex Thr aiu 

Cys Cyfl L*u 5kl V«al Leu Arq Pre lift Trp Tyr Cy.s CyS Cy* Cys Pro 

100 IDE* Altl 

Arg K&n Leu Sex Thr Val Met Cya Ala Leu Pro Trp Ala Leu T^u 

115 12 0 12 5 

T^u Lkv A%n T>ir Lrtu GJu '31. v Lys Pny Cys Cly Phe Leu Val Sex Aj?n 

l.TiO l->- HO 

Gly Asp Tyr Gly Trp nys Tip Thr Ph#» Asp Phe rLt? Thr Ala Val Trp 

145 ISO- lbs lt>0 



-:21vJ> J? 
-.211- > o 0 

4 00 > 3 7 

oagaqtctqa ttctgactta catcacatit gtaqgcctgg gcatttctat ttqcaqc-cX.q 60 
tttcctttgot tgtcc-gttga ggtoctagtc- tggagccaag tgacaaagac aqa^tcan": 12 0 

i so 

30 0 

33-D 



tatttaccicc atgtgtgcot tgttaacatt gcagccactt tgctgatggc agatgtgtgg 

i.l-.fmf t,y I y<] t:t.t i.<;l:. t. tct 1..-3* l.ij q*-:- r.ic^os ^ t.ttaos ciicc aco n ggg a t q tgtc:gcagcc 

icdtr.r.tr.tq ^r.irfltttctt tj-.accLtr.r:r. y i. aim;CI Let ny-jit gctt c^^cactc 

cttatcctct atggasccat gattgttttc 

■:2L0> 3S 
■2ll> 110 
.:2lZ> PPT 

<^0Q> 3& 

LUu Ser Leu lie Leu Thr Tyx lie Thr I'yi: vsl Gly Leu Gly lift- 
1 b 10 1* 

II* Cvs S«r Lcii II b Loo Cys Lr-u S&r Val Clu Val Leu Val Txp Ser 
2 0 a 5 30 

Page 22 



>VO U 1/3647 J Pt"lYUSa«yjlS«J 

Gin Vn.l Ttir Lyy Thr QIm 11c Thx Tyr Leu Axg Wis Val C/3 I Lb Vsl 

3 5- AO 4 5 

Aer. lie; Ala ala Thr- L^u Leu >S«L Ala Asp Val Trp PhE> II* "si Ala 

S 0 5S 6 n 

£tr C'he leu 3er Ely Pro lie Thr His His Ly* Gly HyS V^l Al« Ma 
h5 7 0 7 5 £0 

Thr Fhe L'he. Gly K±b Ph-a Phe Tyr Leu Ser Val Phe Phe Txp Tien Leu 
05 SO ^ 

Hb Lvs Ale L/r-i.. 1 l.eu 11c lw Tyr Gly lie Met lie- Val Phe 
LOO 105 110 

<210> 39 
<211> b2ti 

'212 ■* EN A 

;~213> K . Sapiens 

-"■4D0> 2 0 

ttgt^tggca gtagagag-st. gtcaggcttc agagtcaaca agaactgqat ttitft^init.^^ 60 

nttt^qg*c c-ccon.eottt ggt^agtgac tfcattatctg cgsgcctctg tttctctctr. 1>D 

nhh.t.itffltPfl y<T.-a-:" idtj^y^a tOCO^i t.?cqqi C p ij^gt t?gt 9 gesg-a gast ca gagatqatail' lf5L> 

dgct^cityat cacitctggt ttgtgttccc sggegcA^ca 3 a c:l\ jijrO<]L t tct^agcatg 240 

gatccaaccg trcc-agtctt eggtacsaaa ct gac-a ccj a t cascgcis-cg tgHyyaijaol:- 300 

^ett'-T^t-^Ut* *.t c^-g^^ct g^nct t u-av-g njtgctgacgt gcat~atttc ccttgtcgcia 2-hlJ 

r r.Lj!.^'.":?ji.j-:ja.^ ^<:i^f.^Q!f.*iifh. q l. I. <: l; i] tit I . i c L < j gt^c t erc-u g ca.tg cqtcaq ^oiacgctgt c «i20 

t £cst rt.Aiji t.cc:r.^Aa^ct g^o^caei^d grti:1. UCiM. oL t tint ...uatjett ccaqattata 430 

■sgttcgcc-at tscqcctcat caatatcag-: catctcatcc gcaaa^r.coL. i^t ttctq tg 540 

•sfcqiacctttc cc-t^etttac agejectgagt atqctqagcq oca tcagcac cciaqcg^r.cn: &00 
t c?t "t tiitt c trjt;n<] deceit- ctr-ntacc 



< 210> 


J u 


-:211 > 


2 D L. 




PRT 


<:213> 


H . ^: 


-■A 00-' 




Leu Cyr 


CI 



Leu Cv& Cly Ser Arg Glu Met Sir Gly Phe Arq val Asn Lys Jisn Trp 

1 5 10 IS 

rl* ser- A^rt Trp Tie: <3.L y Pro Pi m Leu VeiI Ser Asp Leu Leu Ser 

>0 i4 70 

Ala Ser Leu Cys L?he Ser Leu L-trU Met Arg Thr Visl Asn Pro Ii^ Arg 

4 0 " 'IS 

Page 22 
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PCT/USUUttlHJI 



filv illy f^ly Glu ftsn Gin Ajrq Tyr Scr Txp Ser His L-e'-t Val Cye 

val Pr.-> Arg Gly Thr Ar-g Leu Gly Fht Leu S*r K-^t Asp ■> Tlir Val 

Prr, Val Fbe Gly Thr L-ys Leu Thx Pro He Asn Gly Arg Glu Glu Tin r 
35 30 9* 

Pro Cys Tyr ftsn Gin Thr Lou Sex Ph* Thr Val Leu Thr Cya lit 31* 
100 105 110 

r^L Lr^-j V*J Glv Ln; Tin Gly A-Sn A.Lf Veil Val Leu Trp Leu Leu Gly 

T"r Arc Me-t Arc; Arg Aen Als Vil Cer He Tyr 1 1 Tten L^u Ala 

13.0 " 135 U(l 

Ala Ala Asp the Leu Phe Leu iier Ph© Gin Lie He Arq Sw-r Pro l.^i 
l^D 155 lf;<J 

Aj-q I/3i> He Asn Ilo Sex Kia II© Arg Lye He Leu Val feer Val 

165 170 "5 

M^r Thr PI if Pre Tyr Pbe Thr fily T.ew 5*5 r ttet- Lou Ecr Ala lie Ser 
ISO Lfi5 'J HO 

Thr Glu Arq Cya leu Ser Val Leu Trp rro rle Trp Tyr 
1*5 2 00 2 05 

<21C> 41 
<2H> 319 
<2i2> DNh 

<duu::' dl 

acagasages aggccsccag gsecttsggc atagtcatgg gacftgtV.Ugt Ottcrtg^tgg 

ctgoccttct tt^tcttgsc g-atcacagat cctttcatta attttacaac ..-.-L tc^rjat. 

r,"U]t: ncoiiUj tot ton Let q ri^tflggctnt ttca*ctctg ctttcaatcc est tt tat at 

^yr;^i..^,;:L:tl.. riLfiCl. L^q l: l\ t: rv: ? i e vjotr ttunggatcia ttgtcacagg- cstgatcttc 

i:ar:.;-;r:tg.4L.:l. fr.r.f:c&r;rc:t AAqf:r:trl;LJ.. tCtgC:cc:£il:<J ettaggctgt gttcatcatt 

caataggact cttttctgg 

-c2Ki = 4 2 
103 

<:.n.^> PPT 

< 2 1 3 :-- H . fi A p i * n >; 

Tioo> -J 2 

Thr Gi'J Ser Lys Ala Thr Arg Thr Leu Gly He Val. Met 1? L y V.i L Pho 
1 "5 10 1 Cj 

rac-e 21 



120 

300 
319 



WO U 1/36473 



V^L Leu Cy-3 Trp Leu Pre 
20 

lie fcsn E'h^T Thr T^r Leu 



Gly Tvi Phe Asri Sex- Ala 

Pro Txp Phe Arq Lys Ala 

His Pxci Asp Fci Thr 

Phe; II* rle Gift A.=;p ter 
100 

< 2 1 Li > 4 3 

<2 1i> 515 

<212> DMA 

<21 Ih> b!i , Sapiens 



taggaat ct o 


^g^g-r^ga-sri 






a£:aaga.>st gt 


a a at g^sa a£ 


60 




*t<:t tat U:a 


cjt-t atctn^t-^ 


««t.CiJ fry 




aon.t gaagac 


120 


dcifiisaat y ct 


tt ag^caic 


tgttgsatgt 


attgtcctsc 




.*i L. fj.'a I". r^;. : > L 'jC 


180 


tt gcctctct 


atgt cc-Eia^t 


gtttattttt 


gcagtt gacc 


ttaatttc-dd 


gtt^gttttq 


2 4 0 


aq gt ct ac 


agtaj.tgttt 


ttaatct gtc 


tctacttctt 


cagaaaataa 


attagt tgt t 


300 




t CiJ-t l..fl«IH»v" 


Oi.L^ct-^cl- 1; 




ttatt guctt 


cccaGa cc5 t 


360 




as<gcataoiit 


cAig gggt t.i 


it d5r:L gaat. 






420 


aaaat ct cat 


abdcatdB^q 


aggagt ta t a 


aaat teat at 


a a i^Cd t C33t 




A H 0 


aeqaggt stg 






get gc 






515, 



<2U» 4 4 

<-ilJ> I 45 

<2i^> PPT 

<2 L j-> ]] . Sapiens 

<4DE» -1 -i 

L*u Kis Gin Ai'g £lv )-iet Val Ala Lys Arg Gin Glu Mel: L-~u Mm Ai.a 
1 S 10 15 

Pho Lcli V.^L S^i- T.n> I.ru Pre Tyr Lt?u Vri.l At-p Ala Val I Le Asp Ala 

y.n >5 -3D 

Tyr M^t Asn Fhe II^ Thx Pre Pro Tyr Vai Tyr Tl'.- L^v ''Ol Trp 



PC'l'/USa0/315*l 



Ph* Phi? Val Leu Thr lie. Thr Asp Pro Lbe 

2 5 3 0 

fiin A.ftp Ltru Tvr Asn Vnl Ph^ LGu Txp Leu 

Fhe Asn Fro il* Leu Tyi' y M*t. h&u Tyr 
S5 bU 

Leu Axg t'j*t lie Val Thr Gly Met lie Fhe 

75 tu 

Leju Scr Ui.i Phc -3*1 ALn Kis Ala Ala Val 

90 95 

Phe 



BNPLX.x:i!) -.WO m J-u4S..iA2l 



WO 



PCT/USlMtiiSSJ 



Cys. Vsi Tyr Tyr Asn Ser Ala Mrt fisn Pro Lgu He Tyr Ala Ph* Phe 

SQ 60 

Tyr Txp PhO Sly Lys Ala. lit Lya L^u ri.o Val Snr "oi-v 1-Y'"- Vol 

6 5 7(> 7 5 ©0 

L^-ij Ar\] Thr A^;f> 5 e r Si±r Thr Thr A£ri Leu Phv S<-r Gli.i ^ l hj Val GLv: 

Thr Asp Lya His Tyr Cys Arcj Asp Leu Lys Thr Asn Leu Lys -Leu Arg 
LOO * 105 110 

Sor 7'hr Aln Lys Ilo Asn Thr Trp Thr Arg Gly Ly* Hitf A$ M His Met 
115 12 0 115 

Pro Stir eye Arg Thi Lie hit tier 'Jfh-L Val VaL Leu Lya His Leu Leu. 

inn 135 no 

tse i" £l er Cy s I L e 
1-15 

-210- 4 5 
-21 li 726 
-:212v DNA. 



it t goaa.Hgtitj 


utcctcqstc 


tatcctctsc 


gccqtcct tg 


gtttt^gggc 


tg t gctgg*ci 


hi] 


Q-cgt-t tqgaa 


act t act ggt 


catgat t cct 


atccttcact 


t eta a casct 


cicaca cacct 


120 


aca a a ctt t c 


t g a 1 1 g o g t o 


c:ct gc*cet q;t 


got §act tot 


t cjgtgc'gsig t 


c-actg tgat g 


130 


ccct t ca-q cj 






t^fctqgt.act 


1 1 qqrrno*c ug 


1 1 n.c-t gt^aci 


2 At) 


tt rcat^Ciit 


r. LL t KtJ.ir:MO 


St-OU l.;:t.gt. 


f.l;.LL.^LtL:t r 1. 


t;-iLLt i:.*t;. L t 


.•a L-. □;<-.: I; ij Lilt <: 


3C>0 


to tot i:yfit.a 




tqttact^t 


c: tctqa cct 


atccaaccaa 


gt t tactqt g 


3b"D 


t cagtt Lcag 


c;gatatqcat 


tgt trtttcc 


tggttctttt 


ct qtcacat n 


cacrc-tt tt eg 


420 


atcttttaca 








ti^t^utl: l>jO 


Lot .n.^ocL-iiL 




n.t i.1i;qft£.]tir.:t. 




.iCfyEi it 


a-scLgggt cc 


tactttqttr 


tCrt tCt At.tC 




I..L. t^L-^-: :i-:i::.=i 




i^qtgt ttati 


tacaqtaags 


tatttt tggt 


ggecaagcat 


fc-ij 0 


uaggc-tagga 


aga t agaaag 


t icarccaqc 


C3 a c;c t cagt 


ccttct caga 


gacrtt acaag 


t-GD 


qedagaqtag 






t)-*.;n - :c*c*.ai3 0c L 


t .^qqflfl 1 1 J]L 


t atqqra^r:^ 


~f2J\ 


tttc-tt 














<21Q:- 4h 
<211> ?.AJ 
<212> PRT 
<^1j> H Sapiens 




Fage 


2G 
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:\A LI (I > 4 *' 

Leu GiYi P.r-i Ely Pid Arg 3ei He Leu Tyr Al* Va I Ls-u Gly Phr; Giv 
1 Ei 10 l.lri 

Ala Val Leu Ala Ala Fh* rjly Asa Leu Lt-'u 'Jal Hftt lit Ala Lie Lou 
20 25 3-Li 

His Phe Glrj L*u His Thx Pre- Thr Asn Phe Leu lie Ala Sex Leu Ala 

Cy.-=; Air- A^p Phe Val Ely Vai Thr Vdi He L Pi p Phr r Thr Val 

Arq £er Val Glu Ser Cys Trp Tyx Phe Gly Asp 3 er Tyr £ye Lya Pht 

65~ 70 75 SO 

Hi a Thr Cys Phe Asp Thr Sex Phe Cys Phe- Ala Ser Leu Phe KIb Leu 

S5 ?0 55 

Cys Cys II* Ser Vol Asp /irq Tyr lie a lri Val Thr Asp Pro Leu Thr 

1 0-i'j 1 05 j. 1 n 

Tvr Fro Thr Ly-B Phe Thr Veil E^i Val Ser Gly lie Cvft Tl* V.sl. Leu 

115. 120 12S 

ier Trp Lhe Lhe 3er Val Thx Tyx &er Phe Ser. He L-'he Tyr Thr Gly 

130 135 HO 

Ala Asn Glu Clu Gly lie Glu Glu leij Val Val Ala Leu rhr Cys VaL 

L4S 150 155 160 

Gly Cly Cys Gin Ala ^r. l.su Glr# ftsn Trp Val I^v* Leu Cys Phe 
Ibh 17H 175 

Lrtii L<eu F'he- Phe He Fro Asn Val Ala Met Val Fh-e He Tyr Ear l.y> 
18 0 1 6 E> ISO 

He Phe. Leu Val Ala Lys ill 3 Gin Ala Axg Lys He Glu ier Thr Ala 

" 200 205 

S<;r Gin £1 Glrj PViri /inr G.I U Sr: t; Tyr Ly* Glu AiTT Val Ala Lys 

T-.r-Lj Glu Ayg Ja^ Ala Ala Lys Thr Leu Gly Tie Air* Ms =0.;* Phc 



■cZ10> 4 7 
<2T\> 6f>l] 

--..x n :•: > own 

'i^ 1 j!-> I L . i] eple ri£: 
<ai>0> 

aaL j i-:, t -q-gf_c: : q ccttactcct aegacccctq gecttq t cti tggectttat caacagctgt 

Par^e 27 
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L L^tcte i.gt 


Lttca r.t ■ijtjg 


r:al- Lf^L-i: t tt: I.. 


rjciqiJ qxi-i r: Kt 




120 




f-ct.r Agat'-e 


ggean-ttagc 


ijiag i^agc: ca c? 


a I. *qtr K ; l: \\q 


a <? 1 l - C c <t e* t 


L B 0 


d C: C a g y t : & y .=l 


t.gsgtccttr. 


a t aa c- a t -g a c 


ccaatttcct 


;=»(; I; lit tt 


li o. *.j a rj c ^ c- 1 c 


2 4 0 


satcctcttc 




ctsccataat 


ccaacatcca 


rt f;,rJ'^ a tfst. t t..=s 


aoa^a^Uia^ 


■3 0 


ccacaactt t 


t aagtqagct 


ctatgt get a 


qqt ca t gttt 


t agaatacaa 


L J Cttd,iiql:.qc: 


3l"0 




gaggcaaci-aa 


■aca-j-aDa-sgg 


tctcattctt 




cagttcacca 


JiO 


id : L lj.t r'frS f: 


>i rj ^ a m Airf m .^s <j 


«L«gL: (, at-C t 






a=3t t gcjgt ca 


4 BO 


qgacctc^.-iiQ 






ccagric-cact 


a-y f:£g Q;.^ a f: 


tL..:.:tti.i.^^tt 


5 4 0 


ttacttaatc 


3a agt ca a gt 


ttggacat-sc 


at gt caqgt a 


^. d a cct ag ca 


-Qacja t g L:t 


SOD 


■3 cot t gat tt 


t aacsact t c-j 


aqggat agict 


caat gt c*tc 


■aagoitoctt t 


t^atgacttQ : 


6bCl 



<2ll-> FRT 

■r21 3 :> a sapiens 

<4 00> 4 8 

Aar> Gin Vai Ala Leu Leu Lbu Ar«^ Pxq Ltu Ala Leu Ser We-t P.ia ?he 

i 5- ID 15 

Lie A-=n S*r Cy* Leu Azjj Prr.j Vnl L^j. Tyx VaL Phe lie Gly Wit Asp 

^& R5 30 

Pti* Trp HLi] ]lis Leu L+;ii J] is: Lsii L^u P.lrt ft J .a L^u Glu Arn Ala 
35 JO 4ti 

L*u ter til -a GiLt Ore ABp ier Ala iLe rrc Als Pro Arg Gin r^T. 
SO 55 60 

Pro Leu His Asp. Pro ll« Tyr Scjf 11* Phe Pro Pro Leu Aan Ltd 

55 iy ' 75 BO 

L^n Pj:o Ly3 GJii Tjt^u Tyr M i * A ft n Fr\> Th r S^ir Ami Arq He Gil; As n. 

BS $0 9 c j 

Lye Pre L'lri Leu Lbu Sex Glu- L*u Tyjf Val Leu Gly Kis V^l L^ll GJ.u 

100 105 110 

Tyr Ar.: n Lt?u T.ys Tyr- Ihv G l.i.i Tlsp Gl.y Gly Lyy Lys Gin Thr ArT £er 
US ]. £ 0 1 i 5 

Eftx Lea Glu Glu Asp Ser Str Pio Ara Ltfu Ly^: Gin Ly fj T^v^ Ai y 

130 135- " 1 riO 

L-ej Lex eye Aep L.yf Thr iier hi 2 LyE lie L?ly Lex Gly pxc? Ala Ala 

Id 5 1 50 1 55 160 

Met Thx Leu Cys Asn Pio Clu Bis GLn Glu Thi" Al^i He Leu Leu Aan 

Pfiqie 23 
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165 170 175 

Gin 3er Gin Val Trp Tin* Tyr Met Ssr Gly Lya Thr Gin Arq Ala Thx 
19 0 18 5 19 0 

Leu He Leu Lys Leu Cln Gly He *.L? Gin Cyr? Kia GLn Asp Fro Phe 
195- 2CX' 2 05 

fttsp Asp Leu 
j 1 0 

<L'10> d§ 

rl'll> 4 65 

<1'12> DMA 

=rl'13> i3 . Sapiens 



tjvt Kqttcnr' 




0 tOiji tiqOt-^t- 


L Ljyi.q tz»C (J^li 1 






CO 




L':y cc-qtciggf: 


c-L-yyccgc'^g 


tygtcttgct 


cigcc r.r.i.gv. tz 


anL: l.-.ji; -tt.'::ij 


L20 


cc ccca acaa 


l t tcgtgct c 


ctqgcqcdca 


tcgtgagccq 


cctgttctac 


ggcaagagct 


t fl i: j 


actaccacgt 


g:t acaarrct c 


acgct gtgt-c 


tcaq ct gc=ct 




ct ggaoccgt 


2 4 0 


ttytttEittn 


£:t L" ttlULTtCli 




fti±t:\.L\tzrif;;\ - 1: 


HCAjiji^-t.i Lai 


t- h rrrjcj-t^t r .^^Q 


3 GO 


p.lt.Tij.q't.yj 1 tjOO 


£ .*i q ;i q « 0 i": 


LjLq y MCaCQ C 


.g-tcqcrgAgMg 


czr.T.c.X-W-Xr.cz 




■3 50 


[■:qL ir;c":g t-y<"y 


LztCCCiiSry'DC C 


cjgtgcgcacc 


c. tcjaagigg.it. 


ggagy g&gca 


ai":i':agg f:t":-r:y 




•IjCC t CC & ga y 


gcaggagagt 


gtqt t ctgag 


tccc-ggqggc 


gcage 




J 6£ 



■^•ii> ice- 

-?L2> PRT 

Lf^ij Pht Thr Jll^ Thr lltr L*u L-y B LEfU Leu Fif q. Thr Gl'J Gin Aid ]]U 
i :-. ID IE. 

£iy Arg Glu Gin Arq Arq; Axq Ala Val CLy Leu Ala: Ala Val Val 

20 " " " 25 30 

LtiU Al* Ptin V.nl Thr Tye, pin* Pro A -in Phn Vyl, L-0-T1 T.cii. AL a 

JO 4 b 

Hie lie Val tei Axq J_*u rhfe Tyx Gly Lya Sfer Tyr Tyr his Val Tyi 

5.0 5 5 6 0 

Lys Leu Thr Leu Cys Leu o->i Cyr- I-Ou Asji P-sri Cyt, Ley A*p Pro PhE: 
£5 7rv 7 5 T50 

V,>1 Tyr Tvt PTiMr Ala fittx: ,M g Gin Pht Girj Leu Arq lieu Aici ulu Tvr 

90 95 

Pag* 2 3 
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Sly CyS PiT-y VmJ Pit. 7*i rg Asp Thr Leu Asp Thr Atr Arcs Giu 

1. Dl> 10b i.io 

Gftr: Le\j Phc Ser Ma Axm Thr Thr Set Vdl Al"cj Ecr Gl u Ala Gly Ala 
Ll.S L^U :i?-. c . 

his Fro Glu Civ gIq GLv Ala Thr Ar q Pro Gly 1.i=h,' Gin A t ij Gin 

130 ~ 135 Iaio 

t^lu Ser Val PJ-j* Val [ J i? Gly Ala Gin Ala Ala Pro Pro Glv L*u Arg 
13.5 150 " 155 160 

<->K'> =►! 

<i'll'> OH A 

<2I3i- H.sapierifl 

ttacttattc tgecctttat ccaactttta attcc-crtttg ctattctcct gccl rat r. t L. 6 0 

ct gr.'^ct cat tttccctatt ^tcctqcctc yea t tqnt c-a aqggatgagg ptgqcaqgat 1 20 

n.r qc-Afi£i>^& wffggtccc^ tgggeca tga &a^gctr.t:l.<j Ljac-U l.^ftwcc tnc^gacact 130 

cc ca c t ct i^q ct geeggc-ag gqatqqaagc tqadtaaqca gqeacgegct -qgir- 6^ I. t>^ijL_ ; ^*J-!.j 

gt gejagagee atagqetatt gggcstggaca rgsttgggtg cctcat-gggia* -gctccccafc. UKi 

ggacctgtgg coccttggg^ c-ctcttattt ctcaccccaq get: 1 1 c ^egg qag.iggf.tra -i^O 

Enitcagao<ra t gccc-oa<r«3g ^teraegtc^g c c?-t g-ggt qg caqcctgttc ctcctgaatc J 20 

t L'qc-ot t'.;t t nqt^^y lo. tq ckjc] a p L ,: J N r 't i-^^iu^cq^tc- tgatrctgee tgctg-g^ccc 'ISO 

Ctyq^ggutsj t c:ttccdf:t£c ttectgc tut Q-tgccL Lczac ixiqqM h^go <jttr»3iincot 540 

tccdcctcta cctgctcgct gtc-a^gq tct tceaoaccts cttcgogcac: La^t.r.coUqw 6 CO 

a^c r. 1 1 :s 

::n> ise 
v:l2> prt 

O. ].::>> Id . p.-ap i ens 
<-yOG> 5i 

Glu Thr Tyr Ser Ala Lbu Tyr Pxo Thr Ihe Aan ber L«?u Cys Tyi ser 
1 5 LD IE- 
Pro Ala S^z F>ir, Sul Gly Lou 7 I.e. FJi* Pm< T 1 r- Tie 1-cu Pre His. He 
'•<] c!D 

Ar>f> It La Gly Met Are Lieu Ala Gly Ser. GLy Thr Mis Arc; Ala Pro Tip 
35 " -10 4 5 

Ms Met Arq Gly sbt Trp Thr Thr Ser Gly His Ser his £er tly cys 
SO " 5 5 SO 

Pa 7* 30 
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Arc? Gin Gly Trp Lyr- I-olj As.p GJlu Gin ^L-;i Gly Ala Cly Sex Cly GLy 



65 



SO 



Gly Glu prt> JM* Tie Gly Vul Asp flxg Leu Ely Cy>; T.fcu M^i- 3-L y flip 
£ E> 90 &5 

Pro His Gly Sex Cya Gly Fro Leu Gly Pro Leu He se-r Wis Ira Ki-q 
100 105 110 

Uu Ser Arq Glu Axcr Phe Lys Ser Glu Asp Ala Pro J_.ys Lie His Val 
115 120 12S 

Ala Leu Gly Cly S«r LVeu Pht Leu L^ii Asn Leu Ala Fh& Leu Val Asn 
130 1*0 

V.hI Gly Sftr Gly Per Lyfi Gly A la A ^ Cys Tr F ; ' AL* Arn Cly 

us i&n iss Ten 

Ala Val Phe Mis "J'yjc Phe Leu Leu Cys Ala L^he Thr Txp Wtt Gly L«u 
16=. 110 175 

niu Ala Phe His l.-mi Tyjr T.^v Lpli Ale Val Arm Val Ph© Asn Thr Tyx 
1FJ0 ltk c - 190 

Ph* GJy His Tyr Phe Leu 
1?5 

<210> 5 3 
<211- 33!: 
-:2l?-- I?'ftA 
=*21? : - H . Sapiens 

.•i^LT.gsjt.r-^y ^.iQtgtaac; tgctt^saat ijc^ggattqd aaL-cat.. n ./ir:t ag^s^y t Ktr: t«D 

L3ddCac3'jc csgca ca gec c caa.a gcea a acactatgta cagaat.:acc coqijat ijor-g 1 : U 

ycgaqaaotgq gattttcsca caggscccat tcacgttcqc qtayca^agc rxcacfl^coa V U 

ccdgcagqga tgaattgctg ct cataaege tggtdtttac atatggsgas attttqtcct :>J 13 

tqttqattat cacaaaaaat acaggattgt tcctgatttt cattgctcct gcg gas a aa a liOt) 

<2it» 5d 

v2Ll> ill 

<212> P K'J' 

<213> K. Sapiens 

c-4 0 0> 5 4 

Asp Kir. Phrt T.ttu Trp Hi a Prv> Gly Glu rye Val FJ/i* Phft f^r Mm Gly 
15 10 IS 

ALi Met L.ys rie Arg Asn Asn Pxo Val Phe Phe Val lit lie A an iiys 
2 0 2 5- 30 
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Asp Ly3 lie £er Pre Tyr Vy.l A^n Thr Sex Val *ist Sbt Sex Asn Ser 
.35 AO 4 5 

5cr I-on Lt u v^l Tiltf Val f?.i j'i l^ii Gyp Tyr A.l* Asn v L ii Gly Esr 

50 CO 

Cy a v *3. J-Y* 1 &>ly Ar '=i 1 THx T.,t»y Tyf He 

val fbe Ely rhe Gly Ala Val Leu Ala Val ir-he Gly Aan L&u Leu Val 

ttet He Ser He Leu Kis F'he Lys Cln Leu His Ser Fro Thr Asn 
10D 110 



-HO-- 55 












cacatcttaa 


caagact qaa 


a a^cat teat 


ttgtttttaa 


tt. t Q&agagc 


aatttatttc, 


60 


ctattcattc 


a+agt ct t 


ttgattttt* 


s^actcott 


t *7- g c 1 1 rj n t a 


sttttaaa gg 


120 






4; a I.. n 1 x t "t"- 


idtPtti;i.j 




t-r-n tqirt t gc 


1B0 


t-:jH^-;:1, fit 


r.t;;T.«rti5i icct 


gar:cagttac 


aataarttt.r. 


aLtgr.t.l- 1. t>:: 


t ,rd)^J ^ <J IJ;^ TJI - 


240 


Cjt a a.a el t£3 a 


gir-cit.aciat.ca 


aaggattcat 


ggctgagtta 


t aa t aagcac 


ac ii'di L _ ag ca 


3rio 


tcatdaatac 


agcicaggggt 


tataaagece: 


at^aacigcat 


caa 1 1 d3 1 ga 


atnaaitiTcta 


3GD 


tdtnqta.dCC 




^D;d.> K i] nt- 3 t^<:- 








4 2b 










/il.-g .d tt yt. 


CM", QiC: a^." ttf 


4 BP 


l> 1 . id 1. . 1;. t h t 3 


r. nr.tLttcgc 


ctgtcigtcta 


ljcl a caiga a 


atatcjttac-2 


a. t_ a l: hl] a a f . t 




.d t. Clctt. rl dt d id 


age, t a ggta t 


aaagaaggat 


agdd&atctg 


r. c a a c a 







<210.> 56 
<21\> ie*o 
-212> PRT 

< 2 1 -3> H , Saf : >i rtn ^ 

-.4 00^ !tj6 

L*lj Thr A&p Phe Leu Sei Fh* the lie Vro Thr i' : ti& He Met lie He 
l 5 10 1!j 

beu Tyr Gly Asn He Pin? Leu Vn.l Al* Ar^ Arc? Gin Aln Lys Lys Ho 

2 0 : 7 -- c -. " 5 0 

Clu Asn Thx Gly l>*r; L y Hi Thr Gilu Sfcr Rt-r 5fM: f* ] ii i T\?i LyS /iirt 

F\Tge J 2 



WO U 1/364*73 



pC"r/usuo/3is«i 



Arg Val Ala Aig Arq Glu Axq Lyr Ala Ala Lys Thx- Leu G 1 y VaL Thx 

Val Val Ala PJjb Met lie- S*?r Trp Lwu Pro Tyr Ser I Le Asp Ser Leu 

6 5 7 0 7 .S 3 0 

lie A-;p Al« Phc Met GLv Tie Thi- Pro a.l* Cys T 1 rt Tyr G In II & 

85 3*1 

Cys Cys Trp eye Ala Tyr Tyr Rsn Ser Ala Mfrt flsn Pro L^u lit Tyr 

IOC- 105 Ltd 

Ala Leu fc*he Tyr I to Trp Pb& Arq Lys Ala lie iye Val lit Val Thx 
115 120 12*} 

Gly Gin Val Leu Lys Asn 3er Ser Ale T/hr Met Asx Leu Phfi Sex Glu 

130 135 140 

Hi*; TIB Alcr V*al Gly Thr Lys Ffrc Aru Pro ^r?u Lys Leu Fro Sex 

1J5 lhll " 2^*? ICO 

Glu Met 5cr Phe Ly£ Sex Bex Lys Thr Met Aan Glu Gin lit: 7-L.^ri Cy^ 

165 170 17 5 

Sei Ser Asn Lys Gin II© Asn Val Fhe Gin Ser Cys Asp- Val 



<2ii> ^7£. 
•iZTlr- DNA 
<2l <> n . Pflpiene 

v4ClC'> 57 

tttgtqqcaa ggagaccctg ottcccqgtct tcctqatc-~t tttcattqcc ctggtcr^ggc 60 

t qgt.->qnn^n e-gnq fc-t t q *" 'H.qq £t I i>[_>q«_:t t<: C Q ^pt'j&gOd'K' iii?fMj<;cttct 120 

K i q 1 1: : t a o q t c c t o a l;. c. l : I - q g rr r: g g g c c g scttc r r. c 1 1 • - r t. t: ci o 1 1 t - r: a l ; * I; < « I ^ a 1 4s p 

,«>t. tutcc t ^gL qt.AOCLciqt dd Littct t ct Q'ttCLdf-Ctc .ibLrjjatttc: rrr:t agtvt.r.^t. ' : : , 0 

tc-riccactgt qatgacctqt qcctaccttg cagg tret gag catqetga^c accgtcagca 3 AG 

ccgagcgct g cctgtccqtc ctgtggcccs tctqgtatDg c+ ncc^cccc cccagacacc 360 

tgtcsgcggt cgtgtgtgto c-tgctctgqq ccctg tcc-ct -actget garic atcttg^aag -3 2 0 

qqaaqttctg t gn?c t totl.J 1 1 1 s^Uja l.-- ; ^.lO+ctqcf '.. K ^q b t-it c,i at** tt ti?at t 4 BO 

t^Litc^i::l..qu .e>^t:qT.oq;cr.y attttttt^t t catg tftt c j t '/tgtgogtLTC « ytct y qcc l: .Sfl[> 

t^Lihyqr.r;,^ gatcctetgt ggctccaggg gtctgccsct gaccaggctg tacctgacca fcDO 

tcctqctcac: agtgctgqtg tccctcctct g-cggcctgcc r:tttggcatt cagtggttcc bbO 

taatittstc: g a t c t ggaa ^ gn 1 1 ct.r?« t g tc 1 1 n. 1 1 1 1 n I w t ^. t X w t -.vj^g 1 1 1 v=n 7 2 0 

ttgtcntqtc at r-t '> I, ti? «r: « gr:rtyT. Lv--- « AC^irCfitCdt tt.attttttc gtgggct.o.tL /KO 

l^age 33 



wo m/>&4*?j 



pt:r/Liiiatty3is«j 



tt^qqa^'rrrj.s ijt q i'?C Q r \ E t CdgC&£C<;g& L<:C r]<fl est C t y.L.v:j q t.: t- CJC 840 

fsytj.dCi^ Kt. u[c^ t g^Liqtysjat ci-r:a gt gaag QStgcttccq t.ujiCiggc.jLCC figgag,^ t.. L:. c: v i 300 

aaqaacjcatt ctg-gjigt-agg gatgqacccD tctacttcca tea tat at at gtggettt§.3 &*i0 

gagqcasctt tqcccc £7b 







5fl 


<:>\ l 






<£1 2 




PRT 


v V- 1 ^ 




H - Stip i.^r'j.^ 


<2 20 






<2 2'L 




UWSURE 


<2?2 






*2?3 




>J a a is Urj x n own 



c 3 00-^ 5 G 

Cvj; GO y l,y£ CI 111 Tbx Leil lie Pre- Val Phe. lieu Ilfi Leu Phe- ^1^ /i I -3 
l' B 10 16 

L.eu Val Gly Leu Val Gly P.srt Gly Pbe Val leu Trp Leu Leu Gly Fbe 

2D 3ti 

Ar^ net Arg Xxg Aejj Ala Pb© Ser Val T'yr Val Leu Ser Leu Ala Gly 
J 5 AO 45 

Ala Asp Pbe Lci.3 Prv* T jflV C>-; ^lr' 1 t1k Cv* T.^v VeL Tyr 

c iC' -S- L > t>0 

Leu Str As/j Fhe ir-be Cys Sex lie Ser lie Aan i he Pre- Sex The Pbe 
tiQ 7 0 7 5 y0 

Thr Thr Val Met Thr Cyg Ala Tyr Leu Alo. Gly 1-eu S^r Met Leu Sex 

6 5* ?0 *5 

Thi Vnl 5rx Tin ftr\j f Lku Hftr Val T.fri> Tip Pre- Tie Trf* Tyr 

mi> 101. 11U 

Ai-L, cyft Arq Axg Pro Arg Kie Leu Ber Ala Val Val Cys Leu Leu 

115 120 125 

Trp Ala Leu 3e,r Leu Leu Leu Ser II* Leu Glu Gly Lys FJic Cy? Cly 
130 135 HU 

PK© Leu PbG At^p fil y ft-Sp ^kl' -Sly Trp Cys Glrr Thx Phe RAp Pb»=! 

145 150 15£ lfiD 

lie Thr Ala 7vi;i Trp Tie Phe Leu' Phs Met Val Leu L'ya Gly Sex 

165 170 175 

5rr=r l-r-'J AIa Leu Leu Val Axg lie Leu eye Gly Sex Arg CLy Lau Pxt> 
lfiO 135 ISO 

Leu Tbx Arg Leu Tyr Leu Thr lie Lry leu Thr V^J T^u Vj-sI 5k r L«u 
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1 £ 5 2 0 0 2 0 5 

Leu Cy^ Giy Leu Pio The- Gly lie Gin 'Trp Phe L<?u Lie. L*.u Trp lie 
210 2lb 220 

Trp Lys Asp Sex Asp Val L^u P>in C\ r ^- Hi-, lie His Pro Val Sgr Val 
225 1-3 0 235 240 

Vol !-**>■■ Fcjt L^u Aitfi ^e / Sir Ala Aen Pi o I J ft He Tyr Phe Plie 

iOb 250 255 

Y*L Giy Per Phe A.ig Lya Gin Tap Arg >laa, Gin Hi£ l^to ILe Leu Lys 
260 2G£ 270 

Leu Ala Leu Gin A*:g A Lb Leu Gin Asp 11* Ala G1l- Vnl Asp >1lg Ser 

215 ' 2 GO 265 

-Glu GLy Cys P-he- fiirvj Gin RJ.y Tin At-l: Aro P*.« Gli;- M.fl Phe Ti-p 

2 90 H> S 300 

"y.s Mxn A-?e> GLy Pro Leu Tyx Phe Hi 3 hif II* Tyr Val Ala Leu Arq 
" 3 L Li 315 320 

Gly E'he Ala 



<?L1> 57? 
<212> L'WA 
<2L3> K . Sapiens 



ntt.t.gow cct 


cactqt tg aoj 


Cd^ac a yict 


qctqaa^qtt 


gt c get gacc 


a cc-d cat at. 3 


60 


gt aac^gqt t 


accaaaggtg 




c-a t aa t ggt c 


tnrasdCCuit 


gt -sarct tea 


120 


Tggat ctgat 


t cteaat gg? 


acaaot rat t 


g n a<a gc£ g'tfc 


t garvT-it t cr]^ 


tcc-t 7 a. 3 tga 


LE:C< 


ccc i c-s a '^o t 


ntgqao.gggt 


,t ti A.;* j ti t . 






iny.js^t tt a 


240 


^rz<-K tv'Tt-.jr; 


L L Lr: 1-Lj'_ L t .i 




■:'d c;t 1 1 cifd § 


tccatqQ^t.j 


a-Esagt gtcgi 


: L ; no 


tidt cq tqgt 




gt cact at ca 


"caaggqgag 


gcagaaagt-a 


ettgeagtea 


360 


ddfitcdcgt t 


gtaccacttEi 


a t agt at t era 


gl tcatccro. 


-ictogtgo.gg 


tcgucsaoagg 


4 2 0 


ctga t ct gt t 


gntuL't- jt tci 


gt l.r: 3 ,j U;i|..q;d 


L' i r^.-ii^ t'-.'i q L 




M lit"! .SlCOZi 


4 GO 


rjt.L.jLlJTi.-l t. i.J t^t 


r.Lj.'js f: S r':Cti L:£, 


qCl^CS Qi m ■- "t a 


C ri t il; C a C i 


tczqagt t ttg 


tgaatgy 


S<iO 


L'.f vi.'-irLr^v" I - V 


COgtj'tgsa tc; 


a t ca cac^, gt 


■a LjCq^aag 






r i 7 ft 



:210> 6 0 

-Zll> 2£«2 

■21 2> C EmT 

::213> H. Sapiens 

:4UU> C-0 



ON 5 OCX • W (il itjJ/.^A^TI - 
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Pho J\i,Cf *Iyj Cys Val lie ILe His [to Met Sex Cys Phe Ser lie Mis 
1 5 10 15 

ljye Thr Arg Cya Ala Val Val Ala Cys Ala Val Val Trp lie lie Ser 
2 0 25- 3Q 

leu Val Ala Val ILs Pre K^t Thr: pk« 31c Thr &rr The A^in A.r.g 

35 JO J 5 

T'hr 7ssn Atq K^r 7\1a Cya L*:u Aep Ltru "J" hi* Ser Eer Asp ulu jl.ku A^n 
5 0 55 tO 

Thx fie- l.ys Txp Tys: Aan L*u He Le-l- Thr Ala Ser Thr Phe Cys Leu 
6f. ' 70 7f. 60 

Pro L*u Val lie- "^1 Thi- T^u Cv* Tvi Thr Thr 1 31^ rit? Hi* Thr l..tt< i 

>:|j e=i :■ 9 Ei 

Th t Hi-i v L^ 11 G Lri Ti " l]C '^P r ^ 1^ Lys Gin Lys Ala Ar£ Aig 
HJCr 1C>5 LID 

leu Thr lie Leu Leu Leu Leu Ala Phe. Tyr Val Cy^ Phe Leu Fro Ph* 
115 12 0 12 5 

his lit* iieu Arg Val 3Le Gin A:?p Arc He Ser Alt> Cys Phe Cln Ser 
130 135 " 140 

Val Val Pro Leu Axq lltr- 7\i-Yj o<ri Met Ivi 3. ho 7 hi PI i ^ T 

14 5 1^-0 1 5-^ 16' ' 

Hi. 1 -; Tyr- A.lrt M h Leu AiSin Thx Phe Gly At Ti Le □ L^U L^U Tyt V«=il Val 
16L " 170 17E, 

Val L^r Asp Asn The Gin Gin Ala Val Cys Ser Thr Val Arg Cys Lys 
1B0 16^ 150 

<21C> 61 
*211> S72 
-cZ12> PWA 

<40D> 61 

gi^gagggctc gt a galeae ta^ccct.icc ctttctgttt ctt-cotuostc tttcctt toc fiO 

atctgtttct cat ijgt est so t^U^t-OlC tttl.^i: ct-i^l:-:*- i-f'^.l:.nl:t Let c^r.r.tc^t.r.^i- LZO 

tctttctc-^t L'v!oi.:K':n^t-t t-t-L^t^tc^ i tctiTditLrci tt tat at egg tggccacttt ISO 

tct ci.t-.-.-t-cr t tql.t.rt.dt.rt ctctctctct ctctttcccs ctttgtctct qcdcqcctqt 2 '10 

tptr?l.l.ttt.c tgcrctgtctc t ctcrt tqece tcatctctct i?tctctctct t geect cat c 30 0 

ict.ctgtctc tc-tgtgtctq tgtctc-^ccc. c-c-tc^ttccr- o.t L k nc^^iji ^un^t iTt.Hin 

ACjgacaactc atgganccc-o cc-Dgr?qnii:-co5 L^^tqi^t.g'a ^Lzi^cr.A^y^ a 2 0 

ttgqcaqtaq cccctciaccc t Ctf'il ttl^u ~ CrtflijaL-tac'c c<qayags: crt q agqaggtgag '38U 
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cggccictctg 




ccqch t c-agc 


ttatgtgaag 


ctgg tact 15c 


t gggact gat 


5 d 0 


t o.t gt cjc-gtg 


nnctt'ji.Tcqg 




-ttntccctg 


ctgq tgct c-a 


aggagogggc 


6 DO 




ijCL.rr:t;i.;si..::L 






c t gci cc g -J" 1 L i 




b60 




K. tc erect tt-g 


tgct-ggctt c 


tgtc^cqiiiTirfO 


ijijt.Tt. r-ttca 1; 




7 2 0 


tacsctcagc 


tgraaqatt-g 


tggcct ttat 


ggDcgtgirt c 




atg cogent U 


7 HO 


catgct gt tc 


t gca t c-Eicjcq 


tcacccgcta 


catr^ccd t c 


gccca cc^lc 


gctt etaege 


6*0 


caagcgca tg 


acactctcsga 


c-a tgcrccggc 


tq 






572 



<- y<)> fj?. 

1 1 L V 1-13 
2 1 2 > EFT 
<213> *Kb-apier<& 

Mrh Mn As» Thr Thr f>ly '51 u Pro Pl.u fiiy V& .1 of-i 01 y Ala Lcii. Sor 
1 5 10 l c , 

Pry F'T.r> ^ftr' /'.Li fifif Ala Tyi Vi L Ly-i LeQ Val LmLi LeU fi.l.y T.*u I W; 
20 25 i0 

Her. Cys Val Eex- Leu Ala -Sly Ajeti Ala; lie; Ltu i]tr Leu Leu Val L^u 
3E> 40 45 

Lye Gl'j Arg Ala Leu. Hie Lys Ala Pro Tyr Tyr Phe Leu Lexi Asp Leu 
50 5 5 * e. in 

Cyc Lcl> Ala A.5]> Gl-y Tie Ar.3 Sex Ala Vsl Cys FTie Pr-o Phft >> I T,r?u 
<f;=. VU j ,c 3 &0 

AIa r^r Vrti Arg [Jib- Gly E-er Trp Thr Phe s*i Ala Leu Eei Cye 

& 5 S*D 35 

Lya Zle val Al-3 Pht t<iet Ala Val Leu I'he Cys Phe his Ala Ala rht 
10 0 105 110 

Met leu Pb* Cys lie Sr-r VaL Thr ftrrj Tyr Met Alp Tie Al^ Kj.y Bis 
115 1^1] 

Aid PL* Tvr Ala Lya Atg Met Thx Leu Trp Thr Oya Ala Ala Glu 
120 " 125 140 

■;2U2-- DMA 

•-£13 -• H,Sapicii:s 

•.: 4 0 n :- ti :> 

aaapat.l'-^rit qtscrtgaact at t>j|aatgga acttggaaat aaaqtccctt ccaa aat aac 60 
LiLtf:r.tca£ cagagagtaa taggta^atg tttt-sgaa^t: gaga-ggactc aaattgccaa 120 



BNSCi'OOlb ■ VV 01 Jti-t/ jA^TI 



PtT/LiStJ«V315«J 



tgatttactc tt tt-it ttt t cc-t L-7t agci t tt^gggata agtatq t gca «*a t ^aaanaa t IB ft 

a a aoa t qiaqn ^rr^f^Ctgta ^COt-j^ t t^t ^s^L t tqqga aaaags t a«u ti^acncaca 240 

■^■ariP'^nri.-siKi t H3..s<i*t.ci.-tt- t ijflC&'jcc^-t. C .^y-^ju^ t-y^lj ij c-ccttt 1 ncu 011 -«<-:ji:. ut^m t 3 DO- 

UiitsU'tCi; t^t gt ^dM& cicaact Ljjc-t c aa^t y dtqt l: c-'jl 00 L t ck 1" r.y't^car.Tttt 3bn 

£atggtgttc ^taattctga dc-b cact cqt tggcaatctg atagt La Ltg tttotoLat.i^ *iV;CI 

acacttcas-a caacttcits: ccccdiaca^a ttqqctcaitt cittccatgg ccact gtgqa o$n 

ctttottctg gggtgt.ctgg toat^cct ta ogt n t ggt'T agatctgctg agcactgttg ---10 
ot 1 1 I; L t n t_> I;, ol: t at 4 t-rjs & 1 t<:« t: ao.-j .d y cv^c r_i. » tt a t gc- tq n.gct cag c 00 D 

(■:f.f;C3t„ 1 1 I---: t ttCftOt t tCdtCtQC£t t C,^ CgCti £ tatQ'Ct g Lgt q': ,y-;*L ticket fifiO 

gagatatas.a qrcaagatga a ta tct t ggt tatttgtgtg atqatcttca ttaqttqqag 72L1 

tgtcc-ctgct gt 1 1 1 tg-^fi t ttggsatgst cttti^tngag ctanctct t en ancrgcgctga 7Su 

agagjatat at tacaaiicjtg ttoiA-'L -iiucsg nngOijil tt gs t'7l ijt-cfctct tt ug-saD.Q-at 34 0 

atctgcrggt-a cti.?*Oi?tt.t» t^Ul-.ttH-.l:: t U*i « tA<:. -;t q-j* lot * t t-H tgt t* tntgt ?0D 

<;t titt^O^o taf.r.lKia. tc^taaAga .%.r-:i.jg ciiqa tt^attagtg at yrc^.i.^T -CSi "*f>W 

-21 1> 2jB 
-212- PRT 
--2 i 3- K . 3 api c n 

ftx"> Rlu T. V r, Thr TL^p Gin Prf? C-Qy f-tet Met Pro Fbe Cye Jlis A an 

]. 5 LCi IS 

ILi? life Aan Lie- tei Cys Val Lys Aetj Asn, Txp Ser A^n Asp Val Arq 

2 0 25 3D 

ALa £e:r Leu; T'yr Srjr L«« W^l, V« I. L^u lie T it; L^u Tin: Thr Lftu Va 1 
3 j -y u d E. 

Gl y Asn T.ttu rift V.=bL lie Val tei lie .ier hid t'he Lya Gin Leu His 

6D 

Thr Pio Thr Asn 'l'rp Leu lie Kis Ser Met Ala Thr Val Asp E'he Leu 

L*u Gly Cyt Ldu Val M*t Pr>i Tyc H^l. vai ft/y Al« Giu His 

Bj to 

Cvs Tr[] Tyr; Glv Glu Val The Cye Lys ii-s- Hi a- Thr Ser Thr Asp 

lou 105 110 
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lie Met Lit; j zser £er Ala y?r lie H±s Leu Sex Vhe lie Eer 31* 

11 [}. i20 129 

jLLsp Axci j'yx Tyr Ala Val Cya Asp Pio Leu Arg Tyx Ly e Ala Ly£ M^t 
13 0 135 1^0 

Asn He Leu Val lie Cys Val M*t Ilc= Fhe He Ser Trp 3er Val Pro 
145 150 155 160 

Al-a Phe AL3 Phe 51 y Nr,t Tlii Phe T.nij Ciu Leu Asn Phe lys Giy 

16.=. 170 175 

Ala Glu Giu lit Tyr Tyr L,ys Wi3 Val Cys Arq Ely Gly Cys ohtr 

1 b U 16 5- ISO 

Val Phe rhe Ser Lys Lie Ser Gly Val L*u Thr Phe Met Th,r Ser Phe 



lie Ala Lys GIj Gin Ala Arg L/eu iL& Ser Aap Ala Asn Glri 
225 230 235 

^2L0> 65 
-2Li> 101S 
<2L2> DMA 
<2L3> R . •?:•■»& 

<4 00> 6 s 

.d^acitcucg gutg^aacct ^ggcatqtat attttqatt^ ttttatgeat iLtcctacitq £fi 

dflqddccact gtcttgetca ^at-agaaqca agatactcd? acttagtttc tctgt a get 120 

ctgettttta ttbttcctqq ttggattgca ctactactca gtttctattt tataat actg 1£0 

attateaaec atgqc,aggga aataact t tg tattggtttt tatggataat ttattafcgtg 240 

tcctagactc tggccttgtc n a-Aj<r^,jqg0. c^jtsaijaag^! crrcgat^tat tatacttcigg 30 0 

□ atgctaij^a q^.cjiJi::l.qar.>-: T.c;gl^ 1 1 1 C:C: sr crqyao.j;i ggcjae$f-46l. U' t.^KL^Ois 360 

jiAr.af:Agg^A tcci^cagat ggcatcagag sacfictataa aaaagaaacg att tgcaaci 4 2 0 

gccarctct? ttcDaeaaca attccttact tptgtggtct geaaggeggt tttttgaatg ISO 

gaficagaaca tagtaatata gpaaaacaca at gat gag na aci^wnrjcaa ntt cac-nn-ct 540 

g t tg g gq a a a ag ca cact 1 1 t tct q<] t-g t a o t i:^ .3 y La m L, U+o L t q £0 0 

gt^c-n.gqt.tq .^iTt^ trjuiitt fttccaaaatg tttgaaac^a gaastgtttt ggatttcr^ga bGO 

r.r.trregaata tttacacatt cataatgata tatcttggaa atggttcocii agtctannca 720 

caaaatttat ttatntttco tatiicaoutt t ^o^tai- Ul-qaa^te 4 U 1 L i.gr.A^a "?BD 

atatttt^aa ta-attttrjQg ^^rjflAiC-ai a gr.t t gcat i cattgascca tcaqicagca 34 0 

aaagcttcan q-tq t yq£*tf. t.tceacttgt ggcatcatgt tgatgctcaa aaagttccat 9 DO 
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attttsgaqc atttcaaatt ttqgattttc a a attache* te^ttaacct gt attta g.=sL. HtO 
gttaactaca gtgcctcttc cacgrgcact ttcaggaagc attcttttat staagcrc lOiti 

■;21I> 327 
-212:- PRT 
<213> H. Sapiens 

< 4 0 H > &h 

T"r He Lya Glu Cys Fhe L-eii Lye Vai. Fro Vai Glu Glu Aid Lfiu Tyr 
/ " 5 10 1-' 

Leu Thr Ser -Lys Tyr Arg Leu Sex lie Cya Asn Leu Lys He Gin Asn. 

T,rn Lv-; Cv-i Scr T,y* lift Trp Rsn Fhe L«y s*i r.Je Aeri tf^-fc M-ut Pre 

-an ^ 

Gin V«H Glu Asn Ser Thr. Pxo Glu Ala Fhe Ala Vai Trp Pht Asn V* 1 

Cvs 1.VS Leu Cys Phe Mat Pro Ly d 11* He Asn lie Vai Gin Asn Tyi 
d 70 75 SC. 

Pb* Gin Thr Mr?t CvS II* ^*<a Cys Tl« J\^n I J* T..y*; Plat Cys Vai 

£S 30 35 

Thr Trp nlu Pro Phe Pro Ara; Tyr He He Met Aan Vai lie Vhe Arg 
100 105 liO 

Asn t^rc Lys Ser Lya Thr Pb«- Leu Vai Ser Asn He Let* Gly Lya Gly 
llf. 120 125- 

Tyr Ser Thr Cy* Thr Thr Va5 T 10 Leu Lev Leu Thr Phr? Thr Pro Glu 
130 1^ 

Mrt TLrru Ly^ VtO- ny* Ser Pro Thr Gly Vai Aen LtLi Lei: Ala Fhe 

;i^S ' 150 155 160 

T.^u He lie VaI Fhe Ser Tyr He Thr Het Phe Cys Ser II* Gin Lya 
165 170 175 

Thr Ala Leu Gin Thr Thr Glu Vai Arg Asn Cys Phe Gly Arg Clu Vol 
13D 1S5 190 

Ala Vai Ala Asn Arg Phu Pht? Ph- HO VM p]it; .^r Asp Ala U.fr Cys 
155 " 2 OS 

Tim 3 l * P-t'O VaI Fhe vai vai Lya He Leu Ser Leu Ph© Arg Vai Glu 

215 220 

He Pro Glv Gin Ser Leu L*u S<sr Phc Pro Ser T-1k l ie H '■ ?> ?ii 4 ft Us 
225 270 2:^s 2d0 

L'be Leu Arc; Pre- Ssr Phrt Asp L-ya Ala Arg Vai Asp Thr He He Ms 

Tsqe 'J 0 
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2*5 .i5 0 255 

Ly>> filri Ty* Lys Vai lie [:er Leu Fi-o Cyi Phe Ii^ lit: Eftt- T l.-e 

2 6 u 2*65 2 7 1 1 

lie Lys. Lye Le J Ser ier Giy Ala He Gin Pro Giy II* lit? Ly£. r: 
27 E. 2 a [i 2 35 

Aiq Sex Ty.c Axg GLd Thr Lys &ei Glu Tyr Lea Aii E&r He: Ala. Ar^j" 
2 90 2S>5 300 

Eis Trc Phe I J he Thr Arc Ser Mst His Lys Thr He jjys lie "J'yr Met 
30 5 ' 31C- 315 320 

Pro Arg Pho Hj.;; PtO Gj.V L.i=tLJ 

<210> b7 

<211> 1251 

<212> DMA 

<213> h-Sapiens 

<400> G7 



c> Ct Oi C-C«st CO 






Qq iS-V^' "-Ki 17 r' ■ ' 




g^ttqntgat 


£0 


L? «1 C 1 L 2 h. 1 i 1 - 1 : 


i-jtj.CL'CddC' C; 


t u g c t ggg 


acgxit.:ttcc 


tggtgtjccoL 


C-CtL-Cfr.r VI. 




gggctgceag 


cc^tgggtt 


gatggcgtgq 


ctggcc-ggct 


cccaggcccg 


gc-at ggaget 


ISO 


g q C Si Cqcq t C 


t q q "7- rr c t qct 


L r't'Tf *7 J P q p 7^ 


1 ^ h ji r 4 * 1 * « ^ 4* +■ 




nt t cct cq CC( 


2^0 




t^-^i.T^ti^l- 


Si i -.^^ m l\.f.f:iliq 






t^:"]ijri,^ r.if q r: l\ 




H T- C t q r ' -7 i.] C f 


tc.:t^i::l .^i-l 1. 


ct:L£ir.g^gD^ 


LiLgtcotaf.t. 


C l; L gc r'.." V. 


OLI.tl.'J.L"-: X tj 






gn.-it.L-.'CflL'cg 


c tgcctgctq 


gcgctqtgcc 


cacact^c -t a 


c<j ctg-gc-.." ac 


•i 1 .' ' ' 




gr.:t gccc-ct 


ctgg^t^tgc 


gcc^gtgt ct 


ggc^tqctqgcr 


cj cact ct t c 


■] l; rj 


^qcgtucc-ct 


ggct ggtct t 


cccc^sqgct 


geert c tgc,t 


ggt acqacct 


qgtcdtctgc 


:-'jij 


ct ggactt ct 


ggq ?■ c&qr c?o> 


tjHagctgtcg 




t rjntf"l«1 1 C-7t 


qggnggct to 




ctacct 1 1 re 


t viW i:gc:t 








i: t. gt 0<J Cft L-t- 


h'r^ii 1 




L-* ;^ - q C;.*isj L"=r"; 




cgggc.ottC'ii 


Ci-'C:gt L-qc 


Cd gC-.-d LILlil t f. 


.1' ! '.1 1 


rt- rj|.<j-;.i q<: i : L 


a L gt got-: ct 


ijaggc tqeez 




c ireaq ct get: 


ctacct gg cc 


7 b 0 


ttcctLtg^q 


acqtut lEi Z-t C 


tggotacctg 


ct ct qqga-r'g 


■c:cct ggt ct-D 


ctccgactac 


BhO 


ctgatcctac 


tc^-SDQqr-t g 


c ct lagccrc 


tti^ct'jt w-K- 










t CV7qr't r^Lj k 




r.r.r.L.cq.gLrig 


c tct c tgega 


gqac-c ggccg 




qgnr3! : lMl;f,t:ci 


tiC CCS iTt Cj3 




cagct =tgatt 


otgagggtcc 


aactctgcca 


10 20 


q<j l^CC:f.'S tgq 


ciq d >~^q ecc =; 


gtcaeagatg 


qa t cc-t gt gg 


ccc-^ncct 


ggt q.TJET'L: V-U-C- 


1 ryt-:l.] 










$ 1 
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acastccagc c^cqiitL^n^ tccwc^gct ca^c^caQC tgaaccctac qgcccsgcca 
,:'itjr.L::o. :3 nt.-:- r^f-agco^ ^CS^^CU ^ frCfctC Li tyg cccayi^ava qtcarjthttct 
.ytiqqccci-iQ- L^CM^qcaL,^ cactsacqtc cagaccccl.q i^ocCiJC t^i: C 

■^2 1 0 > 

<2LL> 417 

<212> PKT 

--21 H, Sapiens 

< 4 o d > £9 

ThL Thr Mel- GJ.u AJ.a L* 1 - 1 r > L V * 1 Tbx C51 y Hl=i ^ rr7 prc> Arg Thr 

j £ 10 1^ 

Glu Le<i Asp Elu Asp 3er Tyr Pro Gin Giy GJy Tip A?p Thr V>1 

Ph* L^-J Val Ala Leu Leu Leu Le'j Gly Le-u tio Ala Aen Gly L*u MeL 

Alu Trp Leu Ala Cly Ser Gin Ala Arg Hit Gly Ala Gly Thr Are ^ 
50 55 60 

Me L-au Lfu Lftvj Lt=o Sex Leu Ala Leu Asp Phe T.^u Ph« Uju Ala 

SB -70 ?5 

Ala Ala Ala L'he Gin He Leu Glu lie ftr S Hi a Gly Gly Hie Trp Pro 

*0 55 

Leu Gly Thr Ala Ala Cy* firq Phe Tyr Tyr Phs Leu Trp Cly Val Ser 

•p'r Jler Gly l^sj Phe Leu L*u Ala Ala L-e i.i Sex- Leu iV^i • Br? Cys 

J 115 120 i££ 

Leu L-eu Ala lbu Cy* Pro His Trp Tyr Pro Gly Kie Arg Pro V^l Arg 
130 13& 140 

L*n Fm L^l> Trp V E il Cys Ala Gly Val Trp VaL Leu Ala Thr Leu Phe 
1^5 150 1*5 160 

q. T p-^ Trp j , u , , Vfil Ph c Pro GLu Ala Ala Val Trp Trp Tyr Asp 

jfr..s 170 175 

■> ftu us r vl > Leu Asp Phe Trp ftsp ^-i Ply G.l.u Suur Leu Arg 

Ififj lfiS 190 

Met Leu Glu Val L*u Gly Gly yhe Leu rr-a Phe Leu Leu Leu Leu v*l 
195- 200 205 

r^v- Kir. V&'j L--u. Thr Gin Ala Thr Cys Ri-q Thr Cys Hi.? Arg Cln 



2:1L1 



1200 
1251 



Gi„ fi ±n Pxl. Ala Ala Cys Arg Gly Phe Ala Arg val Ala Art, Thr lift 



?age 4 2 
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Leu Eftrr- Aid 


Tyr 


val 

2 4 5 




Li 


Axg 


L*U 


Pro 
250 


Tyr 


Gin I.*U -^J-i 


G 1 r > 


u. 




Lej Tyr Lqu 


Ala 


Ph.* 




Tip 


Asp 


Val 
265 


Tyx 


Ser 


Gly Tyr L* ls 


L*li 


Txp 




Glu Al* Lt«iJ 


V=tl 


Tyr 


S«^r 


Asp 


Tyr 

SCO 


Leu 


3 le 


Ley 


Leu Aj?ti Ser 
2E5 


Cya 


L-e li 




Prn FhO 


L-5u 


Cy>; 






Ma 




AU> 




Len Arq Thr 


L«3U 


L*L1 




Aiq Ser Val 
-30S 


L-5YJ 


Set- 


tSer 
310 


Fhe 


flla 


Ala 


Ala 


115 


Cy.s Cli] G 




P ro 




Giy Ser L-he 


Tax 


Pro 


Tnx 


Glu 


i J ro 


£ln 


Thr 

330 


Gin 


Li^-ij Asp Ser 


Glu 

■> ■:; r. 


Gly 




Pre? Thr Leu 


Pro 


Clu 


Pro 


Met 


Ala 


34-5 


Ala 


Gin 


Ser Gin Met 
3-50 


Asp 


Pre- 




L A J. li. <jJ" 


PjNj 


G 1. rl 


V^l 


A An 


Pro 
3b0 


Thr 




G \ ft 


Fid Arq Ser 


Asp 


Pro 




Thr Ala Gin 

3 7 0 




rjln 


Leu 


Asjj 
37 5 


Pro 


Thr 


Ala 


Gin 


Fjtd Git j Eer 

2 BO 


Asp 






Thr Ala Gin 


Pro 


Gin. 


Leu 


Atn 


Ley 


Met 


Al □ 


Gin 
J 95 




Asp 






Vnl J;].,i G!n 


pre 3 


15 J ii 


M * 


Asp 


Thr 




V*l 
0 111 


Gin 


Thr Pre Ala 


ETO 
41-5 


Ala 




Ala 


























<21£» 5 9 
<211> 559 
-^ia> ON A 
< 5 1 H . a p i * r*s- 


























dqoatgctgg get c cat c-ac; 


c a c- c a .=; g a i 


ttiCLil.gtL-:.:-:^ 


I..C i.:t tjtg "t.^ ' 


GO 


catctagtac 


cqct qccacc a< 




eaca 


cci 


:gtca 


gca 


gt e-gt gt qtc 


CLyfrf.r: IT g Q.r-q 


1 


cc"t gt ccct 


get crcaciagc atcctggaat 


ggatgtt 


ctg 


tggct t cctg 


tctagtggtg 


i d O 




ttg'H 


y t y<a 




t c-n tea en 


oc-a t ggctg 


stttttttat 


2 'J 0 




l : t (J t y(] l;Lll a « 










L r i.Li:t:i:.r.i.,ji: 


tK^t, 




3 00 


aqat yiccctt 




g get g t aca t c-ca 


tcctgctcag 


agt get ggtc 


ttil J CttJL'tr:r. 




gt gacct gec 


Dtt: t ggcat 


t cagtgattcc 


tatttttctg 


gat ecaegtg 


gatttgtcac 


4 2o 


gttc^tDtaq 


tttccatttt cctgtc 


-cact 




gtgccaaccc 


cattatttac 


480 


ttcttcjtng 


3otcc 


-ttt? 


q rv 


J^rjct t C 


t n r: .;+■:]■;.] ^ a. q^j 
P^rj:* 


ctotct prjet 


qqtt. 


%t ccag 


5 40 
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^ <)qrt ,- tr?c ttvitLcacqec S.Q&qqtqqz* gycgt^t t t.r- t^cwwiacc 

drqqaucUt r:.it.y£dLjCbg attgsggirra. tqa^gaa^ii ccU^l-qccst gti^LUc^n 

<21B> 70 

^2L1> 213 

<212> FfvT 

'£13- ; - H . Sapiens 

< 4 ft D > j 0 

T-.-'r Prq Pro Glu His Ala Gly L^u His Gin Hit Gin Ala L*u Fx* V*i 

1 J " r, 10 ^ 

Hi«* Pro Val Ala His Leu Val Pro Leu Fro Pro Pro K1h Thr X'ro V3l 
20 25 

q Rr £ P ;r Arg VaL S«r Cys Ser Cly Pro Cys Pro Cys Cys Axq Ala Sex 

Trn Asn Cly Cys 1 Ma Ser Cy* Xl-u Veil Val Leu lie Leu Fbe 

-'- .S5 60 



nl« Vfil LVS HiA Gin lie Sex Sfer Gift Ser Mift Ph* Ph* Tyr V<i 1 

65 J " 70 ^5 HQ 

Txp Phe st.r Ala. Glv Pro Ala Arg Phe Cys Txp S*r Gly tex Phe Val 

35' 90 55 

A^p Pro Hly Aro Cys Prn Pro Civ Cyz, Thr Pre- Ser Cys Ser Glu Cys 
IOC T'* 110 

Trp ^ r Set fJftr val Thr Cys Prn Leu Ala Fh* Sfcr A«p r Tyr 

120 125 



1 IE 



Phe Sex Gly 5er Thx Trp He Cys Ms Val Axq Leu Val 3 err lie Phe 
130 135 HO 

Tc-u s.-r Tin T.nti Ser 5^ fi- 1 " P-^ 11 - T i rr Ph& Fhe Met 

'"' l&rt 15>5 

Glv rex Phe Arcs Gin Leu Gin Aan Arg Lya Thx- Leu Ve i T-ttu Gin 

V " i 65 170 r.'* 

Arq Ala Leu Gin Asp Thr L'io Glu Val Glu Glu Gly Arg Tip Arc, L«ij 
160 1B5 1-^ 

Sp-t Civ Clu Thr Leu Glu L*l> Scr Sor Arq Lea Cly Pro Gly Arg Ala 
19-, 200 

ski- 71. Ls Lftu il*r Val 
:liu 

•- 210 -* 71 

<2il> 5 

<212j UNA 

<213> K. Sapiens 



600 
£5 9 



WO U 1/36473 



pcr/LiSuo/jissj 



<4W» 71 

atqccgaagc 1 c;j.q<Jcoqcaq! psqA^^qa 15 qaqgricqgt g acg-aggatgq rjcccaqggea 60 
gni-ninqgcigt gqqqyc.-'r-grzt q y oci-ij'CO t l g CC-ahc cc*c agcaqcagca t^ogcjcbggc L2u 

gc£-^ac.gtgg cggctgggct ccccq a aga a ctgggtgcag qcQic^g ctya gcagc.^^tq 240 

cagcagcaqqf cagaggqccc .sgntqaqgqc gcacacacac: q tc^qtsaqgt ggcgtqqgc^ ol)u 

q r^qqc^cc!^ g t P.cv-iS pqct □» qq -■tq.tf '""^'^ ^ rjqco-.i^nc-nc t get cea cqe- tcdccrgccqc 3b0 

•laggaoarrtc- £iyc:<2c:f:a^gs r.-gr..iq^racaB g-dagcQic.^y l: qt tyOvPiii :c trpqtctqcac A 20 

gaagL-cc^gg ^acjtcc^gcrc ggectty c= g caa^tcigggq ga t.q^ Lii:.-i cc^ LutqiTca 4E?0 

gcc^aqqaaq 3tc:agatccg cqcaqgccac qtcc^ggsqg tagatgqcua aaqqy t LLcit 
gt aga cat tig- ga get gage 

=:21.j> ~!2- 
PfcT 

<213> J l. Sapiens 
<J0u> 7 2" 

Leu £er Ser Aan Val Tyr Arg Aan L ? ro Phe Al = lie; Tyr lieu Leu 
J b 10 15 

Val AJe;: Cys Ma Asp Lr-L> lie E-'he- L*u Cly Cys His Met Val Ala lie 
2D 30 

Val Pi c> f-^-p L&u Gin Gly Axg Ltf-u A^p Ph« Fro CSly P>lf Vn 1 01 n 

liS ■] 0 " « S 

Th* 5tr l.eu Ala Thr Leu Axg Fhe Phe Cy£ Tyr 11 & Val Lily L*ui iler 
.5 0 5 1. " 60 

Leu Ls7.i> Alvi 7'dii Vfl I Sor V^l Glu G l n Cyn l.r-u Ala Ala Let: Fhe Fro 
S.S 711 7 5 GO 

Ala Tip Tyx- f^r Cys Axg Axg* Fx^: Ax-q His IjkSj Thr Thr Cys Vol Cys 

Als Leu Tbx Txp Ala Leu Cys L*u Leu Leu Hif leu T'hr Thr cys Val 
100 10? 110 

Cys ALa Lbh Thr Trp Ala Cys Lr-u T.-cw I.mv His Leu T.i^v I>>u Sex 

115 l £ H i ^ 5 

Gly ALa. Cys Thr J..£i.> T^u L*i] J^r Rly Ali Cys Thr C?ln Phft Fhft GJ y 
130 13b l^ti 

Glu Pi o r- T'i t c* » ' ft Le ll eye Arg Thr L.t;ij Trp Leu Val Ala Ala Val 
145 " 150 l- 1 - 1 

Faqie 4 3 



woui/M-ra pcr/usuiwis*i 



I.ru T^.n Al« J.i-1-i Lfcy Cv* r;v* Tfsir Mi;. I;. Cy* Ely Ala Sor Lsu Met Leu 
lf;«=. i"7\J 17 5- 

Leu L*=U Aly Va I Glu a Gly Gin Art] Pro Pro Prti Arg Gly Ph* 

LEO 

Pre i j L y kco IIr Leu Leu Thr Val Leu: Leu Ph* Sex £er Ala Ala Cy* 

20O 205 

Arrj Hi 5 
21T> 

<^ll> I DOS 
*212> DMA 
< 2 1 J ** K ■ tapiens 

<400> 73 

atggaatcat ctttctcntt tggagtcatc cttgctgtcc tggcctccct catcattget 

&utv<iv.w.i*v t Litjt- ijLiOlij I- qqc:l , ..:jr.Lc:rl..i 1 t; t ij I t g«t cc; acaag-natga tggtgtcagt 

. r; i. tif.qc: 1 1 c« rrttqa dtct g-grtg t. g oc t g* .-a c c I b<j a I 1 ytg t gr>.:; cat ctgcic 1 E: 0 

ctactcfitig iccsgctcco cagcccttct cgqcccacat igeagacct::!.- qt-qem^ct Q 24 $ 

eg gat §g cat ttgtcacttc ctccqc^ct gcctctgtcc tea egg teat get g M l.e^c 

tttgacag^t accttgccat caagcagccs ttccgctact tg a a gat cat qaQ : t 44 gl tti 

gt gg^eggpg ectgonttge- cygnet gtgg ttogtgtctt acctcattag cttcctccca 

LJ.i-L-vjy^l-^T fcC cJ I .-3 r - I: S qCii^L^X: a*fiijiMC agtgC*gctt ctttgctgta 

tttcaccctc aettegtget gaccctctcc tgcgttqgr:t t:r.t. tcoce^t; ty-^t cc-t c 
tttgtcttct tetactgega catgetcaag attgcctcca tgcacagcera. gcA^l:!'-^;* 
nagatg^ac atgesggage- catg^ctgga ggttatcgat ccccacgqac tcccaqrjc^.;: 
t-tcnaar^ctc tcocit^ctgt gtc-tgtt c=t c attgggagct ttgetctate ctggaccccc 
t: l..;:i:-Ut;i i v i utqy r.;* t r g l.. y-cr/atja l.-jytfu t ncc ;ji.iqiict gtcacctctn cctagtgctg 
gaacggtaec tcjtggctgct eggcytggge aactcccLtc ti^ic^r.*..^: ctst nc~ 
tattggcaga aggaggtgcg actgcagctc taccacatgg ccctaggagt gj 
ctcncctrat tcct rrctott rct^nggec agg-aattgtg gc-ccagagag geecaqggaa 

^,|kl:.r;c:L.gt«:: L':gt lift C t ft|_,- : r. fiC^O K i .: m tj ^ i j I;. L t-U ^1 1 cVjOt 



<9.\ \1> 7 0 

<21i!> FRT 
<2L3> a. Sapiens 



120 



300 
600 



7 6u 
8*0 



9 60 
10D3 
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<4Q0> 74 

Met CI j Ser 3er Pho Sir Ph* Glv V-al lie l&u Ala Val Leu Ala Set 

1 5 10 15 

Lr.u I.lc lie- Alft Tin Thr Leu V<al Ala Val Ala Val Leu Leu Leu 
2 0 30 

lie Hi* Lvs Asn Aep Ely Val Eex L^u Cy* Phr Thr L?-u Asn leu Ala 
3 5 J 5 

Val Ala Asp Thr leu lie. Gly Val Ala lie- Ser Glv L*ll Thr Asp 

5 0 5 5 bD 

Gin Lfti; ^.-i 3r.j. Piro ?r?r Arr? Pro Thr Gin Lys Thr Leu Cya Ser Leo 
£5 7o ' 7 5 BO 

Arq Met Ala Phe Val Thr Ser SEr Ala Ala Ala is^r y^j Leu Thr Val 

BE 9 0 

Met L<au 11* Thr Fhe Asp Arij Tyr Leu Ala lie Lys Gin Pro- Phi* Acp 
LOO L0 5 110 

Tyr T.t?u Ly* Tie Me* It].'/ Phn Val Ala Cly Ala Cys lie Ala GQ y 
115 120 125 

Lftu Tri"> Lrtii V^l ^r Tyr lie; Gly Fh* L^u Fro ley Gly I1b Pre 

130 135 "HO 

Met Fhe Gin Gin Thr Ala Tyr Lya Gly Gin Cys phe; Phc Ala Val 

1-5 5 15 0 i:.£. 1C-D- 

Phe His Pro His Phe Val leu Thr Leu Ser Cys Val Gly Ph* PJi* Prn 

105 170 1?S 

Mft M^K l.r-j l,nu Pne Phi.:: FhT- Tyr Asp M*t Leu Lvs lie Ala 

1^0 195 130 

Ser Met Jlis Eie-r Gin nLn rle Arg Ly.s Met Gly His Ala Gly Ala Met 

19S 2 00 £U5 

Ala Gly Gly Tyr Arg Ser Pro Arg Thr i*ro Ser Asp Fhe Lye? JU;* T.^u 

210 215 220 

r, TKr V.i I 5r*r V.^l. T,<?u lit? (51 v Ser Fho Ala Leu Ser Trp Thr Pre 
2.*fJ 24 0 

Fhe- Leu lie Thr Gly He Val Gin Val Ala eye Gin Gin CyS- 

Tyr Leu Vnl Leu Clu Arg Tyr Leu Trp Leu Leu Gly Va I Gly Asfj ten 

2^0 " 2-65 2 7 0 

Leu- Leu Asn Pro Lou lie Tvr Al-a Tyr Trp Gin Lys Clu Val Arg Leu 

Glrj T..ku Tyr J1i.s H^L Ale Lreu Gly Val Lys Lys Va.1 T*-ii Thr r.rr Pho 

2 9 ft 2 §5 30 0- 
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PCT/USU0/31NH 



Leu L*u Pht L-eu iier Ala Axg Aeu Cys GLy Pro Glu Arq Pro krr? GIu 
305 310 315. 320 

Lie r Ser Cya Hi. 3 lie Val Thr Lit: 5ei S^r Ser Glu Ptvs Asp Gly 
3 25 3 30 33 5 

^£10> 7 5 
<2il> 2137 
<212> OWl 
<;•:]. 3 > H , S>?p.i 0 11 !> 

<4O0> 75 

a act ggaagc; qcagccgt.rt Qcegcccs^cj ^Acscobt ct cnagc-acttt gagtgaccac 

grrcttgcaag ctggtggctg uccccccgaii tt: ccgc-qiit c: t-^yp-icacgg ccgt cgact t 12 U 

^□?cgt tgca tcctgttacc tggaqaccct ctgagctctc acclyCt?»ofc t t-.t gccgc t g ltd 

•:-fr f cil.qi-.orzi tjocrn^crc ^agrsaccccc ccaqgatqcs gqtcccgasc AijtAtiiTtiQ'f* S "3 0 

cg£ : 3caa-r;gc qacgct c^sc- b l.got^iMi/i scccgiogat cgcggtgccc ct^cccgtgg 3('l) 

tgtactcqct ggtc;grggc-a gt cage a tec o-.-ggcftA'-cfr. ut l;i-tct c t g tgggtgctgt l J t"-0 

cjccrjgogc&t gggcioccaga t ccccgt egg tc£.trttc£t liati-.a^i :tq ;.tqi. ipt caogg 41' D 

ac-ctg-atnct oqnco-gcgtg t tercet t tec aaatctacta ccat tc-iaa-. i.i^o^ct 4£5t> 

4<j<jtatfcuny ngtiictnctt t gcLuicg t ng t^cuq tggc cttttacaca aacatgt^L.l 540 

tr-:«yLaL.t:t:r. i :3 L <4 A<: i I. <j I & t r;,aq r:q I •! nagr-gct t CCtggcggtC CtqtdCL'Cg-: t^O 

tcayctccaa gcqctggcgt' cgccgtcgt.t. ai.if^g^tij^O ^rjuqtrtqca gggacctgge 6 60 

tqctgctcct gacD^occtg toeccgctgg cgegcacu^A L. L:tc.ir::;lorj ccggtqcacg 720 

ccctqggcat catcacctgc ttogacgtcc tcaagtggac gatckrr ccc* "^'"i Uj<icca "J BO 

t qtqgno^gt gttc-ctcttc- accatcttca tsctgctgtt cctcatccctf r.t<:ylLv?fcvo> £40 

C»:ql; i^cUa t tt»c-?i?< iqtx; at'catcct .'a «&gct c?t tgcg cacgqaggag gL-cjcacgg^r. ?0O 

ggqagi^aijLiy qoggLL^.^i^y ij.Kyy.yi":i^l.. i«j L.i.-qoeintqgt cttrictggct tttqtcscct 'Jbt: 

gcttegecec caacaacttc gtqctcctqy cgrac:3 1 0-q h q^iH-^qcctg ttctacggca 1020 

agagctacta ccacgtgtac aaqctcacqc tgtgtct-iag ctgi^LLia.-ai: ^-jnt^tctgn IOB0 

a r-rcq t- L l:q r : 1 1 c tt act 1 1 gcqtcocng.^ aatt ccagct gcgectgcg q ga at a 1 1 1 gg 1 1 a U 

g^tqCi::Q.*:Cy ggtg<*:f:'.:^i y a y i-lfi i:tgq ata l: Q i':Cv.: i_: ij t^q 3 p>g rjirit C ttctccgoca 120l> 

gga^cacqtc cgtgcgctcc gaggccggtg cgcacrrct.ga ^g£*t-y qq.i cK^'-*Ol * 1260 

ggcccggcct ccagaggcag gaqagtgtgt tctgagtccc ggggg^gcag cttggagagc 

cgqgggcgca gcttggagga tc^aggggcg catgr?og-5igg ccacggtgcc agaggttcag 13Bu 

ggagaacagc trjoqt-tgctc r^o^yijC^ctq L-^qnpoo^GrP gt rji-iggn .^g qtctcc^ggc 14 40 
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tttattcctc 


ccagq t^act g 




qqtgsqrrsag 


qgtctccagg 


cttc^cccag 


1^00 


qgtogagaaa 




cCo""! on c;cq c 


sonqigqt gr-t 


tqttatcctg 


cagagqgtqc 






ct gtg tctiqq 


^Cf.^. ^ij r:t tr.-q 


tq(: O^c^c-q 


cccggct t 


ttttgtattt 


lt.2D 






t C^OC l"Ti':c*;C|.=i 


c;cl k-Ct tti 


c^ctoctcac 


acctgtccat 


163 0 


a cccqac-ga t 


y q a tat tL-L: a a 


crcagcerca:: 


cqcct^cco-^ 


*.i;t.f:rjij I; hl r- 


t qq ;^ l.-^ t r.jct 


1740 


ct gtggc^cqa 


a ct 3c gag l c 


rcattCCC£,q 


ctcttctccc 


tgctqaLit-L 




1000 


aca cct c^t-^c 




at rgat att c 


aaccagcccc 




cgac; t uggr.r. 




t C-t C'lJ^l t rtt C 


ct ot gt c^g gc 


"lei act g cQ!^'i 




agctct cctc 


i~! q t q i — : x. g a a 


1920 


LcLCi^l.l.:^ 




LtjL'Lrct t. l: U" ; 


OALtr.Lit.Oi^r;.:- 


c:«q cjcv:] 1 tct 


gnit c-t ccgt a 


1580 


gcecqqt cica 




tctqt ttat t 




gcjctqt ggt. 


t^Cty Ujq^t 


30 4 0 


ggaat tc ttc 


ttt cagagqn 


qcgcctggi?g 


c t cctgcaag 


tcagctactc 


t ccgtyccca 


2100 


ct tt rj-.r tc-n 


on c.^rj.:? r.:c*c-£ 




i-C!*<3 ttc- 






2 I'll 



^i!0> 6 

< £ i -:> [L . Sapi en t* 
<400> 7 b 



Met Gin Val Etc Asn 
1 5- 



Sex Thr 



Gly Pio Asp Aan Ala Tbr 
10 



Leo Gin Wert 
IS 



Lou Arq Asti Fro M ;.a Tlr 

20 



V;;j1 TMj T.cT' Prn V H 1 Vn.l 



Tyr 5er Leu 

30 



V9 1 Ala 7'.1m Vftl 



1 1 * F*il> 



Glv -Pi-S.fi Lft'U Fl"|t? -S^jy T.^'.j 
4 0" IS 



Tip VaJ. Leu 



Cys Arq Arq Met Gl y 

&o~ 



Krg 
'•5 



3tr Sei Val lle= fhe Met. lit* Asn 



Sor V^l Thr A-.p 



711 



frwt Leu Alo bex Vsl L*:u Pre 



Phe Gin Lit 
E: Ci 



Tyr Tyr Hie Cys Asn 



Pctq Hie Hi 5 Trp V*l PJi** GJ.y V^i 

■so 



Lr:i,i L*ijl Cy^ 



isn Val Vsl Thr Val 
100 



Ala r-h& 



Tyr Ala Asri Met Tyr a&r 
105 



Her r 1 & L*u 
110 



Thr Met Thr Cy* 11* 
lib 



5L"-r Val 



Glu. P-rq Phc Lcn Cly Val Leu Tyr Pro 



Let "i.i :Jt^i- L,yj3 Aiy 



Tip A r y TU" y Aj. y A A\j T i> i A 1 ^ V j 1 
liS 140 

Tage A 9 



BNSUJOiO -.w< 



WOU1/J6473 pC'r/usaoms*i 



Ala Giv I'hr Txp Leu Leu Leu Leu Thr Ala Leu sex Pio Leu Ala Acq 

145 150 15* ieo 

Thr Aap Leu Thr Tyr Fro Val His Ala Leu Gly lit He Thx Cys rhe 

1-55. 170 17 5 

Val I.ru Lys T-™ Thr Wet L*u P™ 3er Val Ala Met Trp Al* Val 

JkO 190 

Ph* Leu Phft Thr- lie- Phfc He Le-g Leu Phe L^i.j 11-:? Pro Phe Val He 

Thx Val Ala Cys Tyr Thr Ala Thr lit; Leu Lye Le-u Leu Arq Th»r Glu 

210 215 220 

Glu Ala Kis Gly Ax? Glu Gin Arg Axg Axg Ala Val Gly Leu Al 



Ala 

J5 2^0 



Val V a J T.^ii Leu ALa Ph.fr Vfll Thx Cys Phe ftla Pro Astj Asn Phe Val 

i^u L*u Ala His JIb Val Ser Axg Leu J'he Tyr Gly Lye Sex- Tyr Tyr 
2 GO 265 270 

Hi-- Vnl Tyr Lve Leu Thr L*u Cys Leu Sex CyB Leu Aan Aan Cya Leu 

A*p Pro Phi** Val Tyr Tyr Phe Ala f;er Ar-q p.l u phe GIn Lou Arg Leu 
290 29E. 300 

Arn Glu Tyr Leu Gly Cya Axg Arq Val Pro Arg Asp Thr Lea Asp TT. r 
305 310 " 315 32U 

^rn Arn Glu Ser Len Phe Sex Ala Arq Thr Thr Sex Val Axg 3er alu 
325 330 335 

Mn riy Ala Kis Pro Clu Gly Met Glu Gly Ala Thx Axg Pro Gly Leu 
34 0 345 350 

Gin ft re Gin Glu flsr V* I Pha 

*210^ 7 7 

<211> 11 £=7 

<2lZ> 2>HA 

<2 1 K ■ c &pi en s 



.^t.qy;igti:;q^ q^c" f L;. l-::^ Q^c-uocq^oq ^tg^qr-rj^qg teat ccitcct qcattacaa 



tacaccggca agetccgegg tgcgcgctac raecogggLg .-i^qociLqi;:^ en'-^qaegee 
gtggtgtgcc tggcggtrgtg cgccttcatc gt get a gaga atetagecyt gttgttg 
oi;onne*i;[]Ci: .ar^oncqctt (.-.w^jctuw al-gttcctgc tect^gqeag cct&acgttg 
tv-gc^rctgt: tggcaggr:gc cgccr.st-gce gfx-aL^t r-o LActcjtcijew ^o^tncicii 
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SO 
120 

gyLy 1U0 



2 1 0 
.300 
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ct'^aaact^t cccccgcgct ctqgtt crca. cgggaqggaq gpqtcttcqt qgcaet cact 

gcnjtccgtnc- tg.^qcctcct g^ccatcigicq ctrrg-agcgcs gcctcaccat qqcgcgcagg d 2u 

tjqr-c^Oqc'-'C- ucqt utccat; t co^^C^O ^C-qCtqqcc 1 ? t grjc-agccgc qqcctgqqgc J 30 

L>t.L:y-t 4>o tc^tCQ^el. er.L.^c.-iy t-^ t: r r q gt>j t go^. a tt q-7-c:t cjgq t'T-gcT-tcjqac 5 4 0 

gcttgctci'.'A Lrtqt izttgcc gctctec-gcc a^ggceLaiTD L^eKci. tot j Cst^Otuycc 6 DO 

ttcgtgqgca t ectigqcegc tatctqtgca ctctacqcgc qrat ct act-i c^a^tii^ 15 

gccaacgcgc g^cgDctqcc ggca egg ccc ggqactqegg gqaccicctc g^crcc^^gcrg ''^u 

cgfccqcaagc crcgctcgct ggccttgctg cgcacqctca gcgtcigtgct cctgqccttt 7tH.' 

r-it^y^jtqtt <jrri?<]0-ec^ct ■': 1 t-Oot t^-t Q Ot qt t net t-g c+c-gt rig cgt q cccggcgcgc 84 0 

i±r:r t.qt i;. ;Lp t. r.-r.^ LrLOCd qgi;C^ atr friT: iitcc t Ljq^ m f- t <)■ J or^ t gnic ojta ct cn ct t 5 D 0 

ctga^cccca r c atct aca c gctcdccaac cqcgacctgr cjccaccve get. irt.U-gciqi^i.ij 960 

gtctqc tqcg gacgccactc ctgeggcaga qacccgagtg gctcccagca gtcgcjcga.gr 10.^1/ 

gcqge-t gagg cfctctvggsggg cctgcgccgc tpcctgcccc egggecttga t^ggaqcttc 106 0 

nnci.Tri'.-tcqg ^qvijctu^t-i; tvirrv^ncqc- qp egg get qg ncaccagcgg ctccacaggc i 1 J Li 

agnize.: yuit g o ilccscicil' cqc-ccgg^c t ctggtAt.ta^ a&cc-ijL'-:: r (jn a».T«3ot-^a 11 3 7 

<-2L0> 7E: 
^2L1> 3?B 
<Z12> PRT 
<2 1 3> K . 5ci[.ii pns 

^un> 

Mm L G±K Gly Lfrli Lftru Alp Plrt 7*1 0 "Pro V*J, 5*1- GLk' Vnl Tie VwiL 

1 5 10 15 

Leu Hie Tvr Aan Tyr Thr Gly iys L*u Arq Gly Ala Arc, Tyr Girt Pro 

10 2 5 30 

Gly Ala Glv Lea Arg Ala Asp Ala Val Val Oys Leo Ala Val Cys Ala 

T'lie IJ- v&l Lr-u filu. Mn T..7U ft! a Vnl Ltd Lpu Val L«u GLy Arg His 

50 t>n 

Pro Arq The Hi a Ala Fi-:. M-at Phft Leu Leu 1.&L) Gly J>.:\j Thr Lnu 

t.:-. " 7 0 7S HP 

Ser Ai=p Leu Leu Ala Gly ALa Ala Tyr Ala Ala Aan lit: ]_-eu a&r 

Civ Pro Thr L<-'J T.vJ L/?u SH-r pro Aloi Leu T'rp Phc Aln Arc? Glu 

luo lo.s i. in 

Lage 5>1 



F4NSU"x:iD • w: 



WOO 1/36473 ^ 



Gly El" Val Phe Val Ala Leu Thr Ala Ser Val L*u 3<sr Leu L*u A\« 
J LI 5. 12Cl 125 

lie Md Leu Glu Arq Ser Leu Thr Wst Ala Arg Axg Gly Pre Ala Ho 
130 " 135 HO 

Vnl Ser Ser Arc! Gly Arq Thr Leu ftln Wet Ala Ala Ala Ala Trp GLy 

1^ " 150 15* 160 

val Per Leu Leu T.eu Gly Leu Fro ALrt L*u GLy Trp Asn Cys Leu 

IsS.S J'O 17 "' 

Gly Ar 9 Lftii A ftp Ala Cys Ser Thx- Val Uu Pxo Leu Tyx a la Lys *lu 
ISO LBS - * 

Tyr val L-eu Fhe Cvs Val Leo Ala Ph* Val Gly He Leu Ala Ms 11* 
19E ' 200 20* 

rvs Ala Leu Tyr Ala Arc- lie 'Tyr Cys Cln Val Arg Ala Asn Ala Arg 

" J ' *™ 220 

P.xg Lfey Pre Ma Ary Fro Gly Thr hl» Gly Thr Th* r^x; Thr Ary Ala 
:>.^s 230 

Arg Arq Lye Fx* ^ Ser ^ L * u Leu Ar ' 9 Thr lje ' J S " V * L 

" 2 in- 260 

Leia Leu Ala Phe Val Ala Cys Trp Gly Pro Leu Phe Leo leu Leo Leu 
2&0 2t;c ' " 70 

T.-u -W> V*l Mu Cvft Pt» MA Ary Thx Cys Pr*> Val. Lftu > n Ala 

2?B ' 230 2aS 

P.so i*ro The L*u Gly Leu Ala Met Ala Asn 3ax Leu Leu Aan Dro He 

lie r«r Thx Leu Thr Asn Arq Asp Leu Arci Bis Ala Leu Leu Arg Leu 
30 E- J 310 ^ ^ U 

Val Cys Cyy Gly ^ flj.y S^r CyS Gl/ Prt - J G1 '■'* Scr Glu 



."3 13 S 



fi in s er M.=. flftr Ale Ma Glu Ala Sex Gly Gly Leu Arq Arc; cya Lew 

340 3 --15 350 

Pre Fro Gly L-eu Aap GLy Ser Phe Sex Gly Ser Glu Arg Ser Ser Pre 

Gin Arq Asp Gly L*m As P Thr 5*r Gly Sex Thr Gly Ser Pro Gly Ala 

Pre- Thr F\]» Ma JVr-n Thr Ln-u val Sex Glu Fro Ma Ala Asp 
1 30.0 3'J5 



<211> 10J1 

<212> UNA 
<213> K . Sapiens 
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- V 1 Jl . Ilfl^i.ftjiS 

■;$>jos< eo 

M*t 'Jyr Asn Uly Ser Cys Gya ?S j: g He Glu Gly Asp Thr He £er Gin 

Val Met Pro i'r- Leu Leu He Val Ala Phe Val Leu Cly Ala Leu Gly 

20 25 JO 

Asn '^iy Val Aln Lpu Cys Gly Phc CyS Fhe Hi.? Mo I Lys T>ir Tn> Lyr 

Pt-..i S-f.-.r Thr v-'dl Tyr Leu Phe Aen Ley Als Val nla A£=p Pht L*a L*u 
F>u 60 

Met He L'ya L*^ fro Phe Arn Thr Asp Tyx Tyi Leii hrrj Arrj Arrj .Hi^ 
t.S 70 ^ 7.^ KM 
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Trp Ala Phe Sly Asp lie Pro Cys Arg Val Gly beu the Thr Leu Ala 

pc 9C: r 1 -! 

M-t Arp Al* Glv 31ft v*j. Phr T,ei: Thr Val Val Ala Ala Asp 

" 100 ids no 

Aiu Tv i- Pile Lys Val Vfll His Pro Mis Hi* *U V.il Asn Thir l ie Scr 
Hi, 120 Li. 1 ? 

Thr Axo val Ala Ala Gly He val Cys Thr L*u Txp Ma Leu VaI lift 
130 135 H'j 

Leu Glvr Thr Val Tyx Leu Leu Leu -31 u= Asn Hi 9 Leu Cys Val £ln «lu 
145 * 150 155 il3 ' J 

Thr JUo Val Cy-r- CJa> Skv Phfc U* M*?fc Glu S-=r A Li'. Acc. Gly Trp 

lb? 1 7 ^ 

H_i- P-n 11* M^t Ph* Gin Lttu Clu Fhft Phe Met Pro L^u (3 1 y 31ft Tie 

" ieo ^ 

Leu Fh^ cya Ser Pbe Lyu lie Val Tx P Sex Leu Arg Arg Arg Gin Gin 
ic,^ 200 2DC 

L*u Ala Arg Gin Ala Am Met Lys Lya Ala Thr Arg Ph* lie Met Val 
210 " ^0 

v-l AUi Tit* V»l Pb* Ilfr Thr cv& Tyr L*u Pro fftr Val S*r? Al* At-^ 
... .c £30 235 

L BU Tyx Fhe Leu Trp- Thr Val Pro Ser Sex Ala Cys Rsp Pxo S&r val 

^-^ 

uia 151 v Ala Leu lie Thr Leu Ser Phe Thx Tyx Met Asn £er Met 

2&0 265 270 

L-u Arp Pre- L~u v^]. Tyx Tyx: Ph* Sftr Pro -ft* Ph* T,y$ Ptift 

^ 2 GO 2fi5 

Tvr ftsn Lye Leu Lys 3le Cys S&r Leu Lye Pro Lys Gin no Gly his 
i^x Lys Thr Gin Arq Pro Gii: Cla M*t Pro lie Sex Asn L*u Cly Arg 



305 



At"Q Ser Cys He Ser V«l Ala A 5 n -S-ir Phc Gin Scr ni.n hvp Civ 

Gin Trv- iV'p Pet- H .i 31* V«M Glu Txp Hie 

at 

-211> 

<212> L>NA 

't 2 1 3> H . S ap i ens 

^£»t£o aggtgacttc K M-]Ut^ ^.ggcrcac^ tacctccage aattcctcet 60 
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<] ccv-s t cot a t a ryt q a rr:# ucc C-^ ft 1 a a t c-n ; j t c t a& 1 t ca t ag i get r_ a 1 1 ggcggg- 1 2 0 

tiQL.t y LiLjt gl a tt* ccfl 1 1 crtttt Czct-zc r.<fgtqa t ga^ ccj r-c egg tcagt g j cca 1 B 0 

ccatgcicggt cattaacttg qtygtggtcc soagcgtttt Lntgcttf*^ et-OCtc*tttc 24 0 

gcttgaccta cctcatcaag dd^aCttnqd tgtttggqct geccttttgc a AsLtT..jr.:L:!t« 300 

qtn^nritgrt gcacatcwac atrtacctc-n cgttcDtstt ctatgtg«3tg atcctggtct :>h0 

.i-ilsda t^ot t'r.ri UcL . I: h . *:c .:i a y 1/ y i . a « ^ v: t> n .agtcgg n -at t ct a caga aaactgcatg 

ctgtqgct^c cagtgctggc aty tggacg-c- tgijtga ttgt r:« t tgUsjqt^ ^ucctcigttg 4Su 

tctcccqgtfi tqgaatcc-at qagg^dCacE, atgaggagca ctgttLtod,=] Utcflt:^ap.n 5-4 0 

aqLttgctta ca cat at gig sadatratca act a tat gat sgtcattttt gtc stages f >OD 

ttgctgtgat tcfccjttggtc tt^caggtct teat cat tat gttgatggtg cao^agct an H- tM 1 

c70r.nct ct 1 t actzvt ccc-iic c-sgg-sgt t ct gngctcagic-t gaa a aacct a ttttttatao 7LU 

i><}ntc.i*tcot tyt t i: -:j t lit c:i-.t crx iac<: it i: t.ctt t-ag n^tct^ttac ttgaatgttc, 7 3*J 

t yjicgc ;att r: caa t.grrrtg t aatriCjcaticiC' ttgriHiLttt.ii K^r-cv^ t e ttcttgagtc; EMO 

taac£.qcaat tagc-tqctst gstttgcttc cctt tgtctt tgggggaiiqi: csttggtttn 9 00 

ageaaaaqat aattggctta tgqaattgtg ttttgtgccg ttagccac-aa afctaeacrr.^t 

tc^to.tttqc ttocttfcata ttggg^tft nci -in^tgcrgtat aggqgaggta agaatggt^f: loi'n 

t t C-f I L ii ct t g « I " ^ -f V -i {-'-I- S X-nCr- I \->-l Si: -3 i nt.:,;*^ li.s crg.-T--^ P.rjn^ c t at a asa t gca 1 0 E: 0 

agago: ct-ia ttgtaqtcct tatgi^qatcc C t cccatc tc tgaijt^ALtjq coij t-a^Js^an 1 L30 

accaqtgttc; ttgaatccac ctqqagttgc aatattscat tattttcc-ag tacagddtgt. "I.'*r>n 

ct q-t nt gqcc cat g^incjca rrc^t ncqttt tn^gqe-tttt agaqtttcat tagctcattc 

t.?r»j I. ti^rjt r: t Q I ' I;. O h y Lp ■: : A t-'_l- J I t: Ik : 1 1 ^ y y 1 1 + t' j «j> Ot ?> L-OiKT C'Cft 1 1 a«?tt I? 20 

i:tt. 1 1. C a I. ■:. l: f: r.tc a cct t ag^t&ag t .=1 s a. L L n-t.g^j fif: a cca out;:*^ l: r,o 1 «a g^cii 130 0 

fj^rduut c tec: ttcgctadCC aggttagatQj tcccatti'at ctcat^ccct g^ ta^.? .=!.■?<•:: L IJii' 

gataagggg-a gagaatagtt aaaaattttt -stagggt.atc ataactctqg taggaag r..:* l.Sim 

tetgtctaga aatcaagags a^!^ acsacgt gtggcctoct gttataacaa gggtttctag 15 60 

atttntcctg t ganasggtoy tttiianyoct t gnggn.t ca.s r^ttcctc^dt tatcacci-at 1G2:0 

t Ly-ncty i L ij i^lr-riLi^rt^ja^t i-.aMT. r..^aA^c. ctt.SKtg^ac at.at.L-:t js.2^r. ^liLi^t^j.'i le.H" 

r?tqt.:.i^ttl y <jagactat.cc ctgactaatg tyictgqtagg cattaaaatcj agttcccaai^ 

^t>:jci.ntgaL r. .iaaatttttt tctcttctyt tttttgagag aatttctaga tgtcctgggc IB 00 
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li£l<- a C : t t.aa t 
t tCCCiCCCitt 

gtgcagaatg 
gtcaacccst 
tggacaggcc 

tc>:X:hCttCL. 

gaati^at^gt 
gc<?t*j-*«g&3 

•r. aCit gt a cc 
tcttt 



tasgattttt 
aaaattatst 
tgcn^gttt g 
ctacattagg 
ccattgt gtg 

ttceaggtta 
tt gagtaagta 

gutt tt^ty.^ 
t ccr.rr.AaflA 
aLctgtacaa 
cctgaactta 



acsgggggaca 
a tst a tat at 
ttacataggt 
tatttctcct 

□ t t CCC-r:t 

aaattatita 

tatagagiaaa 

St h 1. Lj.r-utJij 

aaata a aatt 



gaaagt tata 
ttaaattata 
sita&acgt gc 
aat gctctcc 
occt cit C-C 
tggr.tLLo.ts 
tttttaaata 
ccnttaattt 

3taactastt|' 
tgacacatgt 
tsaagtataa 



ctcaaatctt 
ccttaagttc 
catggtagtt 
ct cccct age 
at^tgttttc 
I- t^.-.i .q hf?tt 
aatqaaaact 
agactctgtg 

U*tf fit.qtOC'J 
fj-ja.i^t. ,igi ; :C 

taataaaata 



tagamet <::■:-•£ 
tggggtacat 
t goggcacct 

GCCCCa DCC" 

attgt tease 
DtjLt tgctrjrj 
gtgttttt as 
agattaggtt 
rjc^arrqoont 
I t.^l'-a-. t r "g 

atatggatt.t 
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<210> 92 
<21^ 312: 
< 2 1 2 '•- PRT 

Met Thr Gl" A£f> rhe l J -fc- iex Wet 1'ro tly Hi a Aan Thr Sex Arg Aen 
1 J 5 10 15 

;ys A&p P*o 31fl Val Thr Pro his Leu lie o-jr Lou Tyr Fb-s 



iSer Ser Oys rfsp 
2 0 



7 .S 



J!l* Yd T-uj ^.^r. filv RLy Lfi] VaI Gly Val 31* Stt He Leu Fb* L^u 

Leu Val Lys M^t Asn Thr Arg Sex Val Thx Thr Met ALa Val 31* Asn 

r,o 55 t.n 

L*u VaL Val Val Bis S«r Val P)i« Lev LOu Tl": ^1 Pi r> PT.e *r*j T.-i... 

VI) Vb r " :|J 

Thr- Tyr Lr-u J it Lve Thr Trp Met Fh* Gly Leu Fro Phe Cya L.ya 

S Ei 50 - - 

Pbe val Str Ala Wet Leu >Jl3 He Hia Met Tyr Leu Thr the L^u. Phe 
100 It' 5 11C " 

Tvx val Val lie Leu Val Thr Am Tyr T,-.> lie Pb* Phe Ly* Cy* Ly* 
115 12D 

Asp Lys Val CU> Pht; T'yr Arv- L yft Lftii Hi a Ala Val Ala Ala a*r Ala 
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Gly M»t Trp Thi" Leu v«l lie Val He "al vai fit- Leu Val Val Set 
L4f. ISO 1SS 160 

Am Tyr Gly He His Glu Glu Tyr Aan klu 61 u Hit Cvs k : he lys 1 3 1ie 
165 17D 115 

Hi^ Lys Glu Leu Ala Tyr Thr Tyr Val Lys He lie Asn T/yr Hut lie 
1B0 ' 155 130 

Val II© Fh« Val lie SO. a V;il Ala Vril Tie Leu Lft«.j Vrtl Phft GJji '"'all 
195 20 U 2 OS 

Phe Iiri lie net Leu Met Val Gin Lys Leu Arg Hiz Ser Leu leu 3er 

210 215 £2-0 

hit Gin Glu Phe Txp Ma Gin Leu Ly5 Asn Leu Ph* Fhe He 'Sly Val 
225 230 235 240 

lie. Lej Val Cys Phe Let; Pro Tyr Gin Ffri Phc Aim He T'vt- Tyr L«;u 
24 5 2e.R. 

Asn Vrti '.;.:■> 1 Thr His Ser Aan Ala l2ya k&ti Ser Ly£ Val Aid Phe Tyr 

2SD 270 

Asr] Glu lie rhe Leu Sti Val 'rhx Ala He Ser Cys Tyr Asp Leu Leu 

275 28 D 285- 

Leu l?be val Pbe £ly Gly Ser Kia Trp Pb* Lys Gin Lys lie lie Cly 
2 30 255 3 00 

Leu Ji'rp kBTi Cys Val L*>u Cyr. Arq 
?0S" .310 

-21 0 > H ^ 

<212> DNA 

<4Q0> B3 

gcaggagcac fcgsaaatenri guii ca.es tt^t Qt&l.tttttrj t-aatrvTui ojiinqnen?^ 

set to t coat n t g t a t a ■> Cmx] r. L. a r.sj i-at-: a a 1 1 € e L t.gc K- O-j I tfq i:t q X-''j 1 M ■ ) 

canct ntqct tfCtqcg.-trtnc*!. yat: tL>gqt.::c t.gtc<tqaaaa t cccctt':t c gc^ggqatcc itiO 

c:ggg'; g.-att c tc,tacatagt gtttggcttt ggggctgtgc t^ctgtgtt tggaaacctc 2 AO 

ct gg t gs t gs 1 1 1 caa t cct cca 1 1 1 cofti g get q eact ^ t coq anca^ L 1 1 1 ct ■ -q- 1 1 3 C"~J 

gcctct'7-tnq cctgcgotrjci t.l..Lt.t--Lgql.y gijtoL.jtictg tcit^ccctt ^agcatggtc itu 

aggaccs^t rjq ^Lvi^ctqct g gtattttggc;- agg^gttttt gticnttcca cacctgctgt 4 20 

gpLqty^LJAL t.t.tgttactc ttctctcttt cacttgtgct t cat ct cent cqaoagqt ac 3 SO 

OtLijcqqttri ct.gad-ecct ggtctatcct accaagttca caqtatctgt gtc-n^qn^tt 5 4'] 
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<210:= 

■: 2 1 1 - 54? 
<E12> PP-T 

vdOO> tfJ 

M ^ Ser Abu 3Br Sex Leu Leu Val Ala Val Gin Leu Oya Tyr Ala 

1 " & ifJ 15 

Aen val Aan Glv Ser Cys Val Lya lie Pro Phe Ser Pro Gly 3ex Arg 
*7i"| 2* ^'^ 

Val I Lb Leu Tyr Tic V<..1 Ph* Ely Ph* Ala Vol i^v Rl« '■>>) Fh* 

35 

Ri..- l.ftu val Mftt lie Eer lie Leu flie Phe JLys Gin Leu Hi* 

' h.n 55 60 

SeI Pi-c Thr Aen I'he Leu *al Ala Ser Leu ALa Cys Ala Asp the Leu 

Gly Val Thx Val Met Pro Phe Ser Met Val Arc- Thr Val Gliu Ser 

Cy* Trr> Tv- Ph* Gly Arg Ser Fiie Cya Thr Phe Hi* Thr cys Cy S Asp 



10D 



val Ala Phe Cys Tyr sex Sex Leu Phe His Leu Cys ?h* II* Ser He 



115 



Aap Axe Tyr He Ala V«L Ti;r ^Sp Pr* ^ V*.l. Tyr Pro Thx Ly e P h< 



L30 



Thr Val Ser Val S*C GLy Cys He £*r Val £er Trp He Leu Pro 

L-u Met Tyr Gly Ala val The Tyx Thx Gly Val Tyr Asp Asp Gly 

Pane 5* 



600 

6C-0 



tgcatcagcg tgt .;ct gg at c^grwolo st.gtaca,j 0 g ? tgc=tgtgtt ctacaca^t 
gtctatsacg otqyg^gy* <3gaattat.1L c-atyxxcl.^ aetr<t*L*gg aggr^l,;** 
accyttctaa atc^-c^ ggtgttg^a *attt Lctar, ectt^ttar, eccL^ott L 
at^L^Uin Uct^tatg^ taacatattt cttgtggrta ga^gac*^ gaaaaaqat* 
gaaaata-ctg gtagcaagac agaatcatcc tcagagagtt acaaagecag agtggccagg 
agagagagaa aagcagctaa aaccctgggg ?tcac a gt 5 g fcagcatttat gatttcatgg 
ttaccatata gcattgattc att^tt^t ijcctttat-iG *ictttataac ccc-tgcctgt 
,ttt,itn.q, ttt^cr.&Utg gL*Lg.:ttal. taLaactcflcj eQatgaai,^ t M.^fc fctat 1020 
gctttatttt asccatggtt taggaaacjea ataaaagtta ttgtaa^tgg tcaggtttt 
sagaacagtt cageaeccat gaatttgttt tctgaacata tataa 
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16^ 1?[> 175 

T.r.u Glu GLu T -^u Set Asp Aid. Leu Aen cya rle Gly Gly Cys Gin Thr 
ISO 165 190 

VaI VaJ. Mr\ ^:lrj Asn Trp Val Leu Thr Asp Fhe Leu Ser Phe fc*he lie 
19^ 200 205 

Pre Thr Phe rle< net lie lie Leu Tyr Gly Asm 11© L-foe Leu Val Ala 
210 215 220 

Arg Arg Gin Al* Lys Ly-i 01 u fl?n Tine Gly Scr l.y* Thr GU? Sor 
22:5 '-^U 

R^r Rftr GLu fi^r Tyr Lys Ala Arg Val Ala Arg Arg Qlu Arq Lys Ala 

2d 5 250 251- 

Al£ Lyjj Thr Leu Gly Val Thr Val Val Ala Phe. Met lie Ser Trp Leu 
260 2*5 270 

Pre Tyr Ser lie Asp S&r Leu He Asp A- La Ph-M Met G] y Phe lie Thr 
£75 28 U 2 0 5 

Pro Ala Cys lie Tyr Glu lie. irye eye Trp Cys Ala Tyr Tyr A^.r* Sen 

2 4 0 2&S 3 00 

Ala Met Afn Fro Leu He Tyr Ala Leu Phe Tyr Pro Trp Phe Arc Lye 
305 310 31 5 32 0 

Ala He Lyra Val He Val Thr Gly Gin Val Ly Am Sex S*r Ala 

-Thr Mot Asn Leu fhe S-fcr Glu His He 

ins h 
■^li^ ia:u 

v213> H . Sapiens 

accatgaatg agccactaca ctattt^gca .a at nict tut -j atttcu^Ovjii tt-atcioagct 60 

qcttttgqa* at tries ct 7a tQ^aaffStftr: OC-SnJtOMaij.i tQOaetaeol-. cscct ijtl-^t-t- 120 

tatfirjc*Li.o tot trj^N^cit. gggstttcca ggr.-aitigcag tagtgatatc cicttacatt 1B0 

L + .r: a .=!.=> at. gan-ctt ggaa gaqcagc^cc at cat tat gc tqaacctgqc ctgeacagat 2 ; j0 

etg^tgtatc tgacoagect ccctttcctg *tt cectac-t ntgccrirjtqg ognaaact^g 300 

atctttqgnq iitttcatqt-g t.v;nj I: I..I:.,hF_l; cqcitl.^fltjcL to rat t tea a t cjtaLagc 

□ cicjitcotot L.cOrtt-jJ^^t.g tttcagcatc ttccgctact qtgtgatcat tcacccaatg 4 2D 

rirct .rjut r- 1 - « f:cattcacaa asctcgatgt gcaqttgtag cctgt^ctgt qgt gtgqitft n 4 SO 

a II tosctq-g tacjctcitcat tccnstcracc ttcttt.Tci.toci vj^tc^acoaii Otf bqAQ-.sii i% b-d LI 

P a q e 
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J! is Lys rhr Arq Cys Ala Val Val ALa Cys Ala Val Val Txp lie Lie 
145 " 150 155 

tier Leu Val Ala Val Ho Fio H<st Thr Fhe II- Thr Thr 

Arq Thr ash Arn J3*r *le Cys L*u Asp Leu Thr Ser i3er Aap Glu leu 
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72 0 
7 3D 

200 

1020 



•Wtoagcct qlOt^M^t o^C^ttcq gatganctca atactattna gtggtacaac 60 0 
OLyfrttttya ctgC^U** t tt i^^coto cocttgntga tagtgacact ttcjctataco 66Q 
a>, H,=itt.^tn- ^-^t:t ttirfrC uC4tQig^c:t9 t^aot gctgccttaa gcagaaagcs 

cga&g««ta* cc*ttc<^et acncrcttgta r.U- t^ut^t gttttttacc cttccatatc 
ttsjagggtcs ttcaggatcq aatrtcagcc tgctttcaat i^yLtOrttc-:: attCfdfjaatc 
agatccatga agcttacatc gtttctagac cattatgctg ctctqaacac ctttg^l:ai»: 
ctgttactat ntgtcgtggt cag-7gacaac tttcagcagg ctgtctgctc aacagtgaga 
t.i1':;d**ritva yy>wt;x*r.r-t t»i^t-»*9 r '* a&gaaaatta gttactcaaa caacccttga 

<11Q> 8 6 

<1L1 > 336 

<i'L2> FFT 

<213> H . Sapiens 

<4 00> se 

MM A-r. ^Ju Pr-o Leu Asp Tyr Leu Ala Asr: jvlq S^r ftf.p Plus Pro ftyp 
1 "" S I- 1 -' ] ^ 

T wr Ala Ala Ala Fhe Gly Asr, Cye Thr Aap Glu Aan. Il& Pr* Lfcu l.ys 

MBt Kia Tyr Leu Pro Val lie Tyx tly He lie Phe Leu Vsl Gly Phe 
hr 40 4-' 

Pro Glv Asn Ala Val Val Ik S*x Thr Tyr He Phe Lya Met Arg Pro 

?-0 5-5 60 

Tru lv- Tr-r Fi^ Thr" lift lie HeL L«U R?ri LftlJ M$ Cys TJir A^V I/vU 

*s " ^ -jo" 

Leu Tyr Leu Thr Ear Leu Pro Fhe Leu lie wis Tyr Tyr Ala ser Gly 

S 5 go 

ulu Asn Trp He Fhe Gly Asp Fhe Wet tys Lye Phe Ilfi Arg Fhe Set 
100 105 110 

Fhe His Phe Asr. Leu Tyx Sex Sox II* L*u Phe Lsu Thr Cys Phe Sfcr 
115 li n 

ITr Ph«* Arg Tyr Cys Vil He 3i* Hi* Pro M*t Sir CyR Fh* Ssr 
1J0 " 1 3 Ei 11^ 
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Aflri Thr 11 tt I^e Trp Tyr Asn Lku ILl,: Leu Thr Ala Ser T'nr Fhe Uyt? 

L*u Fro Leu Val lie Val Thr Leu Cys TVi Tin Thr Iir, ri^ His Tbr 

2 ill 215 ±2il 

Leu Thi Hlb £iy Lbj Gin T'hr Asp Ser Cy3 Leu Lye Gin Lys A.l Ari] 
225 2-3 0 23b ->AO 

7~'t L^ru Thr EI* Leu Lt;m Lpu Leu Ala Phe Tyr Val Cya L'he Leu Pro 
£■9 5 250 2^5 

Ph* Hly 1 t^i.j All; V£ 1 T 3. r- Gin Asp firq lie Sex Ala Uys the Gin 
2:tyl> Ztt 27 0 

£er V/aJ. Val Pro LkD Arg lit: Arg WkI: Ly*t htxv Thr Ser Phe Leu 

275- 2fi0 2£5 

Asp His Tyr Ala Aia Leu Aan Thr pht Gly Asn i&Li IL^u Leu Tyr V^l 

V«J. Val Err R.$r> ftsn Php Gin Gin Ala Val Cys Ser "Thr Val Arg Cya 
2u>. 315 32 ■> 

"Lye Val Cer Gly Asn L-eo Glu Gin Ala l,y$ Lys lit* Sr:r Tyr £«:-r Aan 

32 5 21 30 " 3 3 5 

< 2 1 D > B 7 

<2ii> use 

• 212> DHA 

dddsattget gtdctqaact attgaatgqa actt go.^. 1 ^ t A^acir.riK^ht oe.?aaatanc 60 

tattcttcaa c-a ga q agt as tsggtaaatg tt tt a^asgt gagagijact ci aA£L<:@f:<:ati 120 

tgatttactc ttttattttt cctcctaggt ttctgggata aqtatq tg.:va aataa^^fil. 1£0 

.aaacdt gaga aqgaact g t s acctga ttat ggatttcjqqa aaaagatasa t caaracaca ^JH 

aagggaaaag to a act 3a tt gacagccctc aggaatgatg cccttttgcc acaatataat. 3011 

tantatttcc tg tgtTTBL-aii * iTt.-^nct err? to aaat gat gt c cgtccttccc tgtscagttt 3 GO 

.10 I, Lj^ ':■ ftr: L 1 : ft 1, a * L. r.. c Lq a f: cft^ti L Licit. 1: c-o-::n a L <: at^ pt L y 1 1 u t tt ct □ t at c 4 2 0 

acacttcasa caacttcata ccccaacaas ttggctcatt cattccatgg ccactg Ly^.i fl&ll 

ctttcttctg gggtgtctgg teatgectta co.gtat crgtq agatctgetg agcactgttg SJu 

qtattttggo g-aagtcttct gtaaaattca rj^caagcaoc gacattargc tgagctcagc GOO 

Ctccattttr: cjttt gtr-^t It'-v ttl. i^t i:i>X:r,.<ic ta« u^l "i L t gtcVH:<X"\ic-t 660 

■^LU^ta l u^a grcaagatga atatcttggt t3tt.t5t.gtg atgatett^a tt aqttq.jao; ?20 

tytcii-ctgrt gtttttgeat ttggaatgat ctttctggag ct'aaacttca aaggcgctga 7Fji> 
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□ g-j-gatst at 


t ac-aa^catg 


t tcact gcag 


eqqaggtt qc 


t cztqt ctt ct 


£ tdy'i^iftCi-H tit 


& 4 0 


rstct 4ggg ta 


ctg-accttt a 


tgaottcttt 


ttatatacct 


qgati^tatta 


tgttb l.glifjft 


9 0 0 










ttasttagtg 


atq^caati"s 


9t>0 




At t^q^L: tyq. 




tivjaatttca 


caaaiqoaaag 


aaiqgasd y c 




tgtg-sacisca 


ttoggosetg 


tgatqjGi^gt 


1; KlLi:r-t*Ata 


tgctggtgcc 


stttctttat 




c t gt jcagtc 


at C^gacCDtt 


ttcttcacta 


c att-a tt C-^ti 




^tcist gt-a 


1136 



<21C> 38 
<211> 256 
^212> FRT' 

Met Met Pr^ l^he Cys Hi a Abh Tier lie A.sji II* Ser ^y^ Val Ly& Asn 
j f, 10 15 

Asji Trp 3er Ai?n As-p Val Arg Ala Ser Leu Tyr Ser Le^ Met Val Lo^ 
2D 2 5 -0 

He- II- L^i Thr Thr Lrju Veil Cly Asn Leu He Val lis Vail Ser He 
:ih AH 45 

cj eI rtls The Lys Gin Leu Hi o Thr Pro Thr As ft Trp l.eu 1 K' Kis Sor 
^0 *S * D 

Met Ala Thr Val Asp Phe Leu Leu Gly Cye Leu val Met Pro Tyr S*i- 



7 0 



S l.i 



Mc>K Vfll Ai-q S*r Al n Glu Bic Cyy Trp Tyr Fhe Gly Glu Val Phe Cya 

' ' " tf5 50 

L.^ ii^ nifi Thr Tt.r Asp 11* Mel "L^n 5£r Scr Ala Ser II* Pbe 

100 HO 

His L^u tier rhe 3l& iier lie Asp Arg Tyr Tyr Ala Val Cy* A-'-P Fro 
115 120 12S 

l P y Arc Tyr Lys Ala Lys Met Asn I If Leu Val He Cys val Met He 
130 13'' 140 

E'hr Ilr r^r- Ti- P f3*r Vn L Pr o ALn E'hc AIn Phc Gly Wet He Piie 

145 ' ISO 155 160 

Iku GIU L^i Ry.Ti Phtt Ly* Giy Aid Glu Gig He- Tyr Tyr T.ys Hi-*". V«l 

iss no 1 7 ^ 

C«s Arq 151 v Gly Cys Sex Val Phe the Ser Lys Lie Ser Lily Val 
" 180 1*5 190 

l~u Thr Phe M-t Thr -Scr P'no Tvr lie Pro Gly Set lie Wr?t Leu Cys 
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VyJ T v t Tyr Ajrq lit? Tyr LeiJ I ],t? Ala Ly* GLu Gin Al.i Arij He 

211) " 215 fc£0 

tjtr Asp Ala Asn ULn Lvs L-eu Glri He rjly l*d Glu Met Lya Aan Gly 
22^ 230 235. 2dL> 

He Sea Gin Ser Lvs Glu Axg Lys Alo Vai Lys Thr Leu Gly I Lb Val 

Wet Sly Val Phc L-?u. lift Cys TVf- cys Pre- PTic PJic He Cys Thr Val 
2*0 £65 "70 

Mhth A%p pre Pht? T-eu Tyi J J.*? I i.r Pjto Pro T>ir I-ni? ftsn A?p Aln 

-75 £SQ aU5 

Arc^ Gly At 9 Ala Aen Sejr All =- 

<^ll:-- 102 J 
<2J2^ cm 

rj.jSii I < j ;-j hqt: LiCf.tt.J-.g.lQa CBatSt.rtdtt ditdttttcL Citqf.Q;* m j .^.-3 i : L. i H 1 1 C.^ 

a j t u^tiL-t c-c gtgcteccct qtacaqtt ts atggtgctcs taatt^tgac cac^c t<_gtr. 

qgca^tctga tagttattgt ttctatatca cacttcsaac a^cttcatac ccG3acs^t laO 

t qget cat t c pttr^utqun t3r;tqLiji^e<j tttot I 'j- tryi cigtrjtut <rnt catqccttac 2 AO 

j l. ij i-* l-<!j .-i QrftL-:r.^ctqa qcactgtttig t art tt.ggag aagtcr.t.Lttq l.a A.^t I Oisr.: 300 

scraagcaccg a cat tat get qdgctca-jcc tccsttttcc attrtgt^ttt catcttcatt 

qaccqctact atgctgtqtg tgatccactg agatat.saag ccaagaitgas ta t ct t qgtt JiO 

stttgtritga t gatc-tt&Eit tngt t ggagt gtccctgctg tttttgoatt tgg.aat^atc d£0 

t ttotgn,:<qc t aaact t cJ^nc?r->.- 1 r?^.? g^yato.tatt p-cynacotrt teac-tgeaqa 5 40 

r.vjagcjtt got olq lt::L. let. t teQCAaAar.A tgi-.^cjaqiao ti;^L^i:.i:.t?l qdutt c-t ttt 600 

l.rtlii t.5tf:t-.^ giLtt^Udt gLtatgtgt.r: ta L L u cag a.=s tiLfitcjt t .i t- C^L.^fi^tf * r?n 

caqgeaagat taattagtga tgocaatcag aaqctcc-saa ttggattg^a aatgaaaaat 72 u 

gqaatttcac aaagcaaaqa -aaggaoagct gtgaacvacat tggg^attrt gatgggagtt 7fc!0 

ttcctaatat gct^gtciccc tttctttnty trjtacT^tca trivia ceo ttt tcttcnctac £40 

€tt t at t r.vjj rj r,i tvs t; L I.. I .■'jj* * L q i-i I: gt .i t-tq & L : L L C|C ■ t. t t q L a cjt t. g.d A 0 I c: I .« f:.* I, L;. I.. ^ [i' ) 

.;i jtO'.ja^t^Q tt.tatgcatt tttctatcct tggtttagaa aagcact^aa gatqatgetoi 960 

f.rtgqtaaa3 ttttcceaaa agattcatcc aggt^taast tatttttgga attqa<?ttca 1020 
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<210* SO 
<212> PP.T 

<dOO> 9D 

Met Met vrc Phe Cys His Asn Ha He Aen 31s Efer Cya Val Lys Asn 
1 & 10 15 

A^n Trp &^ Aa-n Asp Val Arg Ala &cr Leu Tyr S^r L»u Met Val Leu 
20 J0 

T]e TIE Lr;« TTir Thr Lev v*l Gly Aen Leu He V«l He val $«r Tie 

35 

Set uia Pb* L.ys Gin Leu His Thr rro Thr Asn Txp L*Q Jit Hi* Sex 
DO 55 60 

MPt Ala Tbx Val Asp Phe Leo Leu Gly Cys Leu Val Met Pro Tyx Sex 

6S "7 0 gG 

Mei- val Arg Ser Ala Clu His Cys Trp Tyr Phe Cly Glu Val Phe Cys 
g.s 30 ^ 

Lys He Hi--! Tin: Ser Thr Asp lift M&r. Lt^ Eft/- S^r ftL& He Ph>: ' 

J.QO ICS 110 

Hie Leu ser Fhe He £ei He Asp Arg 'J'yr Tyx Ala val Cyu Asp rro 
115 120 125 

L~u ftm Tyr Lys Ala Lys Met Aiii H* l.^u V«l Tie Cy6 V*i Met He 
L30 

P] ]t - T] ,. r^.c Tip Sftr V*L Pro Ala Vel Fhe Ala The Gly Met lie Ph* 

Leu GLu Leu Aan L>be Lye Gly Ala Glu Glu He Tyx Tyr Lys His Val 
1GS 115 

tJla r V s Arg Gly Cly Cys Scr V*l Phc Pho Sur Lys T L<r- Sex Gly Val 
LB0 " | fl c> 1HU 

L*n Th-- Pt--:: MrtL Thr S*r Ph* Tyr He Pro Gly lie Met Lea Cy-^ 

v . :j l Tv . r T-.-x Arc: He Tyx Lea He Ala Lye Clu Gin Ala Arg Leu He 

•3** Mp Ala Asn Gin Lya Leu Glr. He Gly Leu GL-> Met Lys A^n Gly 
230 235 

Hp aer nin 5er Lys GLu. Arq Ly- Ala V*L L V « Thr: t.ku G L y v«l 
245 ^'j ^ b 

Met Glv Van Ph<? Leu Lit Cye Txp Cye Fxo Phe l/he He Cys Thr Val 
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M«t Asp Pjto Phe Lrru VI Is Tyr II* lit: 



Thr LeLJ Asp VsL 



J i.v JJ.tt Trp Pht Gly Tyr Leu 'Aet\ 
"2. y L I 2 5 & 



'J' hr fhe Aan i/ro M&t VaL Tyr 



ALa Fti* Pher 'fyx Fid Trp Fhe Arg Lys Ala Leu: 
BOS 310 315 



GLy Lva ILe Phc Gin Lys 
32* 

L*^> .^r 



'<t?r Sft r Ait; Cy 



Lys t'tet M&t Leu Pb*3 

32 0 



T.vS 1.1 Phr Lt -'- J l "^- IJ 



<210> LU 
<211> 

<21?> DMA 
<213> ti.&apiena 

< d 00> 9 1 

ctgtaaagta gattgtatcia ngncttr.itt] I- rjatoca OttoaiiiJ-Loe tttjyi^t^f]""' 

ataattactc aei»#gnti.iwt- H cift^t c-gvy Cisg^tjag-gga r.ggt.gadttci cctggAgat:c- 

c^it^qonccq t: 1. tic cat act egg teat tcaoaccitc attgattcac caggcaccac 

t .ci^t.-y tLta qc^qqactct gqgqacccca «5:atgcacact accatg^aag ctgacctggc: 

tgecact qg c cjcagncc-rc gencagayet t'j^t'.i^t^nq .^iil.crit.ftCr uir-i-^nntjbrji 

ctgqgacsc-g ntcttcc-tn^ t-q-01; Llil: Lt«g gg .:tgt':-:":ftc<Csi-a a t iiiggt-.t-qaT. 

ggcqtggctg nccqrjr.'tC-C': ^/gccccjgca tgg a uet.ggir: scgcgrctgc; c^ctq'CtccT 

gctc^gc-ctg 9O0-:: <..-t:l:i%f -J a<: tt c 1 1 gtt c.-tggcag ca geggectt^c agat rctaga 

l.r-oggcdt gggggacact ggccgctggg gsca^ctqcc tgccgcttct actacttcct 

atggggcgtg tcctactcct corrgoc t ot t c^tijctg^^r; ■ju^fjtcp.^r.i- ttijai^^ot-n 

CCtgCtdCJcg ct-gtg^CC^O iLl-jt tgg gCA-T: f: gc OC&qLQCgil'C t^.:<rr:>;r.L:tC: 

■igt':;tqc-^<:r. ggr.^t ctgyq tgctggccac actct t cage gtgccctggc t^gtcttccc 

C-qaq^rtq.-:? gt ct ggt ggt acgscctggt catctgcctg gacttctggg acagegagga 

qi:;r.qtf q-ctg aggatgetgg aggtcctggg gggcttectg cc-tttcctcc tcictactcgt 

ctQCCdCqtg ctcaeccagq acacarwt n t^rjLMCctnc c-nntr-gricr^iy^ Lirvjai^^cc-nit. - ; 

dgcctgccgn gci^t t cgc-^r, qtptqp'Ci^ g^LXL':«l:i-L:Lg Lcaqcctatg tog t c^tgag 

get geect :ic caqut q Lrc:<u: aqetcict eta cctgqccttc ctgtqjggacg tcta.p:tctqg 

ctor :ct'juto tgypaijyccc t ggt eta etc cgactacctg at cot at too acagct gect 
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la J 0 

150 0 



cagcr.ccttc ctcr.^^.c^ t^co^tgc c^ncctccgg accct^ctgc gctccgtgct llll) 
Ctcgtcctt u qi::pgcaqr:to tc-tgcgagga ^cijfjecgggc agcttt*cgc ccactgagcc 1200 
dcagacLCflti ctagattctg agggtccsac t:tgec^ae c^Ug <W-Cc-agtc 12 60 
acagatogat cctgtc,gccc agcctcaggt gaaccccaca ctccagccac gate y o^L. r-t- 
cacagctcag ccacacjctga accctacggc ccagccacag tcggatccca cagcrcagcc 
acrt^utcvsac ^cfttn^c D.q <^ tjt 0 Xjd t 1 ct gt g gcccagccac aggcarscac 
taA^ti:ta<j acr:cctocac tt get gees g tt^tig Ly.icf: a^v scat uhg ^tr^acicttc 
cccidCLCca tcct^catc ctaccccsgg gg sect t gag gacceagcca cai. cUicKgr; 156 0 
-stctgaagga gaasgcccca ^gcjccco gecagaggeg gccccqggcg cac,gccccac l£2U 
gt^ogcitcc agwacaogc aggcccncca riagcacjtgaa agaccccagg gcagacagag 
q n p. a g c ^ g t c -n <j a 

<215.> h Sapiens 
<d00> -?2 

Leu Ala Tip Arc Oys Thr Ala Pro 5*r Iau Pxc Tyx 3er Val lie Wis 
L ' 5 l-O _I . 5 

Thr TJ ^ 1LO F ' r '' Thr TK£ Pl ° Cyi Fr "' A] ,i1 01 y Lc " u - Trp 

Mi 

G1-- Fxo Gin M*t Asp Thx Thr H&t Giu Ala Asp L*u Gly Ala TJ"i r rjly 

Kia Ai-g Pro Arg Thr Clu L*u Asp Asp Clu Asp Sex Tyx Pro Gin Gly 

5 0 55 SO 

Hi *? Tit. J"^P Tht Val Phtr L-eLS Val Ala LtU Leu LfcU l.*U Gly T.*.'i.J Pir^i 

£5"' " -70 ^ ^ 

Ala Aer, Gly Lea Met Ala Txp Leu Ala Gly Sei" Gin Ala Arg Hie Gly 

B5 95 

Ala Glv Thx Axg Leu Ala Leu Leu Leu Leu Ser Leu Ala Leu 3«r ABp 
100 105- HO 

Phe Le«n Pii" f-^ 11 ? ' 1 ; * Ai " Pl '"* G1 " 71<; ^ ,llJ T 

US 120 12* 

nix? Gi« Hi* Txp Pko Leu Glv Thr Ala Ala cys Arg The Tyx Tyr Phft 
' L3S 110 

L*u Tiip Glv Val Sex Tyx Ser Ser Gly Leu Pht Leu Leu Ala Ala Leu 
US 150 155 160 
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£:er Leu Asp Arc] Cys Leu Leu Ala Leu Cys Pre Kia Trp Tyr Pro Gly 

165 170 175 

Kls Arg Pro Vy.1 Tin Lou Plq Lou Trp v^.l Cys G] v V^.l Trp V*l 

1 F l 0 l$s 1 vO 

L^u Ma Thr Xt*'J PL* St; r Val Pre Trp Leu Vfil Pht Pro Glu Ala Ji.L ^ 

1. 9s 20D 2fl* 

Trp Trp Tyr Asp L^u Val lie Cys Aep rhe Txp Aep Ser Glu 

210 215 220 

Glu Leu i"E-r Leu Arg Met Leu Glu Val L&u Gly Gly Phe L*k> Pro Phe 

225 230 ^5 24 0 

L-jii Lou T^'.j V*J. Cys His Val Leu rhr Gin Ala Thr ALa Cy*s A^g 

2 J 5 250 2 £5 

Thr eye His Arq Glri Gin Gin Pro Ala Ala Cya Arq Gly Phe Ala Arq 

2 GO 2*5 2 70 

Val Ala Arc; Thr He Leu S*r Als Tyr V*l '-'^2 Lou Ar^j Lau Pro Tyr 

2 75 2 50 2B5 

GLn Leu Ala Glu Uuj L*u. T v r hrv 7il^ Phe l>;u Trp Va;j Tyr "tr 

Gly Tyr I^tiJ T-&u Trp Glu Ala Lej Val Tyr Ljer Asp Tyr L-eii rle Leu 

' 3L0 3 ID 310 

Ltd Aetl 2er Cys Lea Sax Pro Phe Leu Cys Leu Met Ala Ser Ala Asp 

32 5 3 3 0 J 3.-5 

Leu Arg Thr Leu Leu Arrr S<?r Vn.l Leu S*r Scr PL* Ala ALn Ala Leu 

3 40 " l-i50 

Cys Glu Glu Arc.; Pro Gly Ser Pho Thr Pro Thr Glu Pro Gin Thr- G.lu 

T.^ij Js.^p Grit C;lu Gly Pro Thr Leu Pro Glu Pic Met Ala Glu Ala Gin 

;-?7C' 37 5 3BD 

Ser Gin het Asp Pro Val Ala Gin Pro Gin Val Asn Pro Thr Leu Gin 

2-6 5 3" 3D -'5 3 5 400 

p r r. A\X*r G-:.'-r Asp Pro Th ^ Glu Pro Gin L-^m Aaji Pro Thr Al^ Gin 

4 05 d 10 dl. r f 

FYt. Gin Jer Aep fro Thr Ala Gin Pro Gin L<eu Asn Leu Het Ala Gin 

■120 425 430 

Pre Gin Ser Asp Ser Val Al-> Gin Pro Gin AJ.:.j Asp Thr V^l Gin 

Thr Prr.i ft I o Pro 7*1 -a Ma S^r ^er- Val Pro iler Pre- eye Asp Giu Ala 

4 SO d B5 d 60 

Sor Pro T'hr pro Ger St- r h'ls Pr-i Thr Pro Gly Ala L-eu Glu Asp Pro 

Page 6 7 



BNSLj! x:i[j -.WO 01 ^4/ ?A2T 



WO lj 1/364*73 



pciYusaitmsaj 



4 65 W AB ° 

Ala Thr Pro Pr<i Ma Gin G\y Glu Scr Pro Ser Ser Thr fro Pro 

4Gs Jyu 495 

filn 7X1 ,3 Pro Ely Ma Giy r?ro Thx 

son so & 

<ilO> &3 
<2il> iU13 
<Z12> DHfl 
<2 l ~y : ' H . Sap i ari a 

«t. 4 c.,ic^t8 r:r;^(jg^yc tgacctgggt catling ex aiMiogr.ni;.:^ C^^a^RPCt-t CU 

ust^tc-ayc, actcctaccc ccaaggtggc tgggacacg^ tcttcctggt c-gccctoct.c. l'-'O 

ctccttgggc tgccagccaa tgggttgatg gcgtggcic'-r cccgctccea ggcccggcat 

cgsnctngca cgcgtotggc got gotcctg ctcaocctgn ccotctctga cttcttgttc 

ctgcicac&jc- ^gncQtt-Ton ^at.ccttfqu-j s.tccngcatc' gggracnctg scccictgggg 

n.Cinctncr.t <j tsrujOl t •: ST: It Cvt t pg. ^O'J t H+ cntfCttsctC cggrctcttc 3W> 

i<l;gi:Lgt.o g .r:iTf:t..2dQ,cct rqaccLCtuc ctgctc-gryc tgtc^r^c* t. \-<&t f " :cct 4^0 

ggqciccgcc cagt ccgcct gcccctctcjg gtctgcqccg gtgtctgggt (.ctgaicaca 

ctrttcagcg tgccctcigct ggtcttcccc gaggctgecg- tctggtggta cgacct i^gtc 

atctgcctg-j acttctgnga can^lti'^zig ctgt rjcctgn ggntgctgga cgtcct rr. ? gg 

f?l?( _^ tor , tlir: c tttc^tr.;L tv-'trj<;tt:.:^o t iK^^gtgQ t'jijcccagcio c^oaccctqt 660 

.•^•HOCtiO..:-!; ^c.ii^&aoa y.:aii(:trqf:^ i/i^UqciC-^ij n-; I; "t r;n^r ntj tnNjrcnoqg "720 

accattctqn csgcctatgt ggtcct^sgg crtgccctacrc ag^tgc'CCCiA y^LtjiihriU^O 

ctggccttcc tgtigggacgt ctactctggc nacctgctct gggaggcrct ggt ct set cn- 

gactacfc'.*J t-::i^t«-t:r.vi r.^/ 3 r.U^i.-±r. ji/oucwttw tctgcctcat ggc/iagtccc SCO 

,^.:n;ccLiq« c tig.:: r goj ctccg tc-i<=t.r; r.iieLnc:l!t:^ i;0 ] uiiu^t; ..:tg r^-a -jnag 9GD 

^ggcrjgggca gcttcsc^cc cactga^cca c?agacccagcr tag at Let g* *jsgt.r:o*3t?t 

ctgecagags cg-atggrag-a ggcccagtca cagatggatc ctgtggccca gcctcsggt.g 

aaccccacac t^a^cacq atc^gatc^c -acagct cage- cscagctgaa ccctacggcc 

-n.li.:r.^i:-;iqt i^qst,:;^^: rtqUnCiiQir^i r ;i ye L Lf Aa f: C *. u.13.:: ni.: A ij r rj.r 1200 

g.%,:r L ; !t ;gtgg cc^aQccaca ggcagacsct aacgtccaga cccrrtgcacc tcjctgccagt. lifiQ 
tctgtgccca gtccctgtga tqaagcttcc ccaaccccat cctcgcatcc tsccccaggg 
gcccttgsgg acccagccac acctcctgcc tctgaaggn-j *aa^ccccag cagcscccc? 
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L--::hO,viijyj.i}C?. r: cC'-goiq-c Q C a occ & I'll 3 

<210> 9 4 
<211> i 19 
'<212> PF.T 
<213> H . SspierjB 

Mr: I. A_<;p Thr Thr Met Gi-v /lis &-?p Leu Gly Ala Thr Cly His Ar<j rro 

1 s "JO a 5 

Arq Thr Glu Leu Aep Asp Glu A£p 5er Tyr Pn;. pin R.l.y Gly Trp Asp 
2 0 2D :}o 

Thr Val Phe Leu Val Ala Leu Lau Iau Leu Gly Leu Pro Als Asn fily 

I,mi Ht^t AIli Tip Lout Ala Gly £ r ±r Gin Ala Arq tti.3 Gly Ala Gly Thr 

4^ £0 

Arg Leu Ala Leu Leu: Leo Leu Ser Leu Al* Leu ^ er Aftp Phe l-ru Pho 
6L 70 75 Bll 

Leu Ala Ala Ala Ala Phe Gin He Leu GLu I la Arq his Gly Gly Hi ft 

S5 90 $5 

Txp Pro Leu GJ y Thr Ala Cy£ Arq Phc Tyr Tyr Phe Leu Trp Gly 

1 1.] ij 1 0 110 

Val c-ejr Tyr Si^r r>^t fily T>s:m.« Phe L^u Leu Ala Ala L*u Ser Leu Aep 

Arg Cys Leu Leu Ala Leu Cy& Ltd Kis Trp Tyr Pre Gly Hie Aiq Pro 
130 13*- 14 0 

Val Arq leu Pro Leu Trp Val Cys Ala Cly Val Trp Val L*u Ala Thr 
14 5 150 15* ISO 

Leu Phe Ser Val Pro- Trp Leu Val Phe Pro Glu A). 5 A In Val Trp 7'rp 
169 170 ].75 

Tyr Asp Leu Val 11a Cye Leu AEp ftie Trp Aap 3er Glu Glu L*n> Sol- 
ISO 13 5 1 4 IJ 

Leu Arc; Met Leu Clu Val Leu Cly Gly Phe Leu Pro Phe Leu Lau Leu 

19* 2D0 205 

Leu val Cys Hie val Leu Thr Gin Ala Thr Ala Cyft ftivj Tlir Cys H-i.-? 
210 215 22G 

Arg Gin Gin Gin Pre Ala Ala Cys Arq Gly Phe Ala Ara Val Ala Arq 

2 25 2 3D 2 3* 2 40 

Thr lie Len Stx Ala Tyi V^l Vnl X^u ft-ri] ItM Pro Tyr Cln Leu Ala 

Page ej'4 



BNS[K.X.:i[. .:WO C' IM^MA,-! 



WO U 1.^6473 



Gli> Lf^ T.*y Tyx Leu Ala Fhe Lftu Trp Asp Val Tyr Scr Gly Tyr Leu 

2£D 2t5 i7C1 

Leu Tj-p Glu Ala Leu : r al Tyr £er Asp ryr Leu I.l.r l-eo Leu Scr 
27S 2S0 

Cya l^u Ser Pro Phe Leu Cys leu Met Ala Str Ala Asp Leu Arq Thr 



290 



9 5 juu 



Leu L*u Arc Ser Vol Leu Ser Ser Phe Ala ALa Ala Leu Cyis Glu Glu 
305 310 315 a^O 

p rq prv. fij v PTifi Thr Pi o Tlu- Oiu Pro GJ.n Thr Gin Leu Asp Ser 

' " *3 0 335 

Glu GJly Ti-D Thr Leu Pro Glu Pro Met Ala Glu Ala Gin Ser <S 1 r, Met 

Asp Pre Val Ala Gin Pro Gin Val Asn Pro Thx Leu Gin Pro Arc; 

555 3 60 

As.p Pt-t3 Thr JUa Sin p i O 1. n L<Su A--. 1 1 Pry Thr Ala Cln Pro Gin £er 

3?S 330 

A*-r> Pm Thr Ala Gin no Gin Leu flen Leu Met Ala Girt P/:o Glu S<*r 
36 5 " 3 DC' i95 ^ 00 

Asp ^cr Val Ala Gin Fro Gin Ala Asp Thr Asn Val Gin Thr Pro Ala 
4 05 <10 

Pro /'la Ala 



<21C» 


95 


<21 1> 








<213- 


Artificial Sequence 


-:22-0- 




-:221> 


misc feature 


'-2 2 s.*" 1 ' 





ttcaajBacr.r. atggaatcst clttctcett tg^Uttho <:ttqctgtc 



-210/- 


?b 


-Vll> 






PtlA 




Art 1 f i LU 1 cr?qii rj:]]c^ 


<2 20> 




<22l> 


mi£c feature 


<221V> 


Novel Sequence 


4 0 0 > 


96 



ttcactcnaq ttagccatM -sactctgnQ-c trj^agntagt gacgatgtg 



40 



WO LI 1/36473 



pcr/ustmmssi 



<2lO> 37 

<2gi.> 22 

<£12> DMA 

<J 1 Art i i" i ci« 1 S e-qu^n r:r- 



<A0O> 


57 


gctc^accca ctcatctatg cc 


<2J.I) * 








< 2 12 


DNtt 


<2 1 2: > 


Artificial tegutrjCt 


<22D-> 




<22L-> 


iniac featuie 


<223> 


Hovel S*d nijence 



^aacttctct gccctta^cg t l j 





*9 


=:2i 


20 




DHA 




Artificial Sequence 


o. 






til. 'Lso fij^tuf ■ 


::223> 


Hs":-V«.l. S^t"] uttcK'+t 


-4D0> 


3 9 



aaagcagc^c ccccaatacc 2 0 




Arti ficial 



100 

••1 




iTiiBcf ea tore 
Hovel Sequence 



<i|un> 10l> 



<:2LD> 



101 



71 



BNSLOCID -.WO OUiaIA-MZTi 



>V0 11 1/36473 



<j.H> 2'r 

<J12> DMA 

<21'<> Kl riiiicial i^^u-enc*! 
<220> 

<221:- misc_f * ature 

< 2 2 3 > W o v e 1 3 e q J b nee 



<4DC>:- 101 



<:.112- DUA 

c j 1 2 Artificial tequttij c e 



*22 0;- 

<22L-- mi5G_featuife 
-.2/3"* Novel Secuiwiice 





1 1] 2 


r.tci:ti 


r.cagtctgca geeggt tctg 


jl0> 


10 J 


<21 1 - 




-:212- ■■ 


DNA 


<2iv 


Artrif ica*l Soqu^ncr. 




inijsc f eat □ it; 


if 2 3 :- 


Nove-1 tequerjc? 



-4 00:- 10 J 

^ t:h t cot eg c t at ct g t g cact ct n eg 



^.210> 


104 


^11> 




<."}*'..> 




C2l j > 


Arr.itfi-cia J. r^ai,'* 


< 22 C- > 




<:221 > 


misc feature 


--:22 jv* 


rfovel Sequence 


t J u 11 > 




c.c-Las^gtgc ac&tjatag^g '= 


■:21Q> 


105 




19 


v"212> 


DMA 



"213> Artificial Ee<3u*mc€ 



3D 



PorrF: 7 2 



WO I.U/36473 



pcx/usaumsai 




Novel 5 £41.1 rjn-C 



ddece c-a tci tctacaccjc 




■- jLD> 
--: C 1 3 > 



IOC 
IS 



Artificial £■ 



< j 2 1 > mi *s L:_t ^fltJJrr 
•:.'J2i-> Wfive-1 Sequence 



1 1 o :> 

< ^ 1 2> DWJi 

<>15> Artificial Sequence 
C20> 

-"12J> misc_fe.atur& 
-: 2 2 1 Wovwi S equ e n cb 



•:: J LMj> 1. fj 7 

■li-id t aaojct t L'catgtdidd cgggtc-gtgc tqc 





ID 3 




3 3 




DMA 




Aj: tif ici^i £-?r;u^ric& 


2 C<> 








<2 2:<> 


Hov^ 1 0 fti'ju tj/i 



<4 0fj> LOS 

i^c-at t ctaga tcagtgccac tcaacaatgt ggg 



::2i0> 109 
<*:•><: »> 

<J2 2'1> misc ie-ature 



Pag* "7 3 



WO Li 1/36473 



pcx/usa«/3is»J 



<22$> Novel Eequenirft 



<>i 0 0 > 

gasgcccagc ectgtttscc 



<zm> no 

<212> DMA 

< i' 1 3 > Tir till c i ft 1 Sft^u&n re 

<:•: d> 

v^'iL^ mis c_ feature 

>3ovel Sequence 



4 00 * 110 
tgn.cotacct gtccgcagcc 



<2ll> DNA 

<21j:> Artificial sequence 
:220-- 

<223> Novel Se^uencE 



gatcaagett atg^cagqtci acttcccaag tatgc 





112 


< 2 1 1 > 


3 4 


<21L:> 


DMA 


<?i 


Art i r V3l SoiivCriC 












Novel Eequence 


<4D0> 


112 



gatcctcgag get ^^c^cqca 'iaaiacicas. ttcc 



^jj0> 113 

^ L "i 1> 1 " 

--■ I 1 ~$> A 1 1 1 f:ia 1 St: qu$nce 
<l:20> 

<2 21> misc_featijre 

<223> Novel St queries 



Page 7 4 



WO 111/3*473 



pcx/usuumsai 



■ZAOO> 113 

esq cco^ii ac atccsagtc 



*- Z 1 1 13 

<Zl2> DUA 

<I'L3> Art i rici&l S^uArt^S 
<-j 2 D > 

< 2 2 1 - misc_ft:dtyie 

<r23. ; Hovel SHtjuence 



-:^00.: LI 3 

i.-C=c :0«ct t ei est cosset c 1 " 



■:: 2 1 Q> 1 1 Ei 

<111> 3d 

vl 12 > UNA 

:I13> Artificial ilequenoe 

■ ; :£21> jni-sc_f e?.turc 

-2 2 3> NDVC-1 cFqUMMX* 



£.2.t Cij ijt:r.<: yr-:^q qjaijCda t yaoa.^t:. fray g i-j <? 



^2 10^ lib 

.;-ii> 

?12"- PNA 

J.::*:-- Arti fioia L Scqucnce 



11 i' 

<213* Artificial Sequence 
•■:4 00> 117 

acaqccccaa agccaaacac 2 0 



^1D; Lie 

: : L 1 - 22. 

•-. 7 1. 3 - DWr, 

^: 1. 15 > Ariiil.icial tle^uence 




mi sc_fe dtui'H 
Novel SEqutnce 



■•^0n:: lit- 

ton ;i * t - 1 c 1 1 at at a t gt t oq iiaaa ,r„.=<t t cat q g 



WO 01/364*73 



Krr/uS(jo/3i58J 



<40Q"> llfi 

ccgcaqgagc aatgaeaatc 





115 


<c n> 


15 




DNA 


<?1 3*> 


Artificial Seqn* rice 




11* 


ct^aaagttg tcgctgscc 






=:211.= 


21 




DNA 




Artificial ficquyHW 






<2 2 1 


rvdfic: feature 


<2 23> 


Novel Sequence 



13 



<1G0> 120 v 
cq=itta tecs cacttt^icr: c 

-.210 121 
•:.>il> 2 5 
<Ui> DW7i 

<215> Rit.lticiisl oe^uerjCE 
<4 0C-> 121 

gc-ataccatci aat^gcciic tagac 



<210> 


12*2 


=vll> 


jn 


-.>i:> 






7 s , r 1 i r].i\1* 1 N*sji.j 






<;*2i > 


jtil £ e feature 


-:223> 


Novel Sequence 




1 27- 


L>t:a T. cf cqri G t.rift h q Tit g 




123 


< 2 i 1 > 


2D 


<212> 





■:213> Artificial Sequence 
-221 > To! so feature 

:223> Hovel S^hk=14:k 



WO U 1/364*73 



per/usauyjiSBi 



Ctytd-C-t.:!, yti^CtL'ttL-C 


21] 


-211 > 
-:212> 
•21"» 


124 
2 2 

Artificial Sequence 




•: : 2 1 : 

• 22 j > 


Ni?ivei Sequence 






124 

c.jatot t c^t t qsint t tc- 


2 2 


v:>in-> 

2 12*> 

2 1 : > 


? :;s 

ftrtlf ici-al Rftqy&nce 




-■ 2 20 > 
2 21.- 


miac feature 
Novel Sequence 




■=: 4 00-= 


125 

J il r+O H CSC 1 1 C il t CS CL' c 


2 3 


■:2lu> 
■:21 1> 

•-:212> 


1 B 
DMA 

Jljftif iciel Sequence 




;■ "7 fY: 


misc -.it lire 




■Ol 00 :> 


12b 

ragca tagtaqcq 




•2 1 0:= 
-211- 
"2 1 2:= 


127 

20 

DHA 

ft rt i f i u i a L i ;ju<; nee 




0 2l.r- 
<v2:>> 


jul e iz ftiture 





: -1 00> 1 2 7 



BNSLKX:iU -WC; (>Uu4riAPTi - 



WO 0 1/36473 



:ag3gctt-ga tqatqaqgdC 



<210> 123 

<S11> 20 

< 2 1 2 - DWA 

<213> Artificial ^qu^ce 



<211> 




< <! ^ J > 


Hove I Sequence 


<4CO> 


123 


cccataggaa gtagta^aa^ 


<210> 


1 ^ 


<21 1> 


y 


< 2 1 2 > 


FT*T 


<213> 


Synthetic eub&trate 


<220~-^ 






raiEO feature 






<4 0Q> 


129 



Rl.* pro ftt-cj Thr Pre Gly Gly Arg Arg 

c210> 110 

<211> 52 

<212> DMA 

^213 > Art l f id a 1 S enjii v-nc-v 

<2.2rO> 

-:2: 1-- nM=.o_t>^Liir^ 

^i'i -] -' H nv-H L .^ttqaen Lit: 



. .■100- 130 ^ ^ ^ 

qcgtaatacg actc^ctata ggga<rsnccic qtqtctquta ^ctot^ttt 



-:Z10> 


131 


<£1 1 > 


aft 




DMA. 




Artificial Sequence 


*-:22 0> 




-22i> 


E1113G f CSt '.Tri 


-.2'2 3> 


>3qvc1 S0<>f^nt;:-e! 



tg c C AC a L-r.sj stgc as ct cc 



WO U 1/36 4 "7 3 



PCIVLJSUO/JISSJ 



'211-- AS 

*2\:.± on. a 

<21 2j- ?ii t 1. Tie i £ 1 Eequ t: nC£ 
<JDO:> 

gcgt jatacq actcaatate gggagacctg ccacactgat qca^ctcc 



-210- 13 3 

--211- 24 

•-: 2 1 " ? DFA 

-21 Artificial Se^u^c* 



<22i> crti £c_fest lute 
*:'2 2 3^* Novel S& qui a nee 



<2 1 1 > S l~> 

<21_<:> eutificial Sequence 
v^Dic 174 

gcgt Aatacg act enctflty qir^^^^CT-cfc j^qr^nr.-i-Cl: nt 1 1 M.cta t nrc 50 



■a^ L7f- 
<y\i:> dvja 

<2l > Ai-t ificia.1 Sequence 




mi sc_f eat ure 
Hovel SAnuercD 



ij L;:rti:,;"id,^.-=ii::.^ L~:;=Li; L t CCct t 3, agee 





i s C-_ t '-!=*■ a 1 1 j re 
JJDvt; i Sequence 



<JD0> 1_«6 

iJiCOltaa-tacq actcactat-j rngnf3^or;[?-:;- attCCrataag 



WO 11 1/36473 



pCr/USUOVJlSSI 



<210> 137 

<21l> 23 

<2 12> tfHA 

< 2 1 j. : Artificial Sequence 

<2Zy^ Novel Sequence 



<4O0> 1 

qctacqiitic tctttactat ccc 



<210> 13B 

■c212> DMA 

<Z13> Art if i t,:i..ii Rftcpientife 

<213> Novel Sequence 



g-=gt*at*cg ast.^ntata fj<gg<KJflf=cr r t atgagcagcra attcatccc 









20 








Artificial Sequence 


,1220:- 




<22t> 


ml sc_teat UXB 


<223> 


Novel Se^utncfe. 




L3* 



i yacce* a 3 p.i sal ft A<g 





HO 




<:211> 


■1* 












Artificial 


Serpen 








-: 221; 


ir.i^-.'_fe..Aiu 






Huvftl Secju* 


ifjCt: 


<4QG> 


HO 





Oiogtaatacg actcactat?i qn^u* 



Page 00 



WO U 1/36473 



j^IYUSUO/JISSJ 



r£ll> 21 
■21?^ DMA 

Art if ic.i.? i 5 -5 qu&hcb 

:l ^ i n v > v ei s & tju f j c e 



- 43 H 0 > 141 

t t=.± ^agcjQ csdttcatcc 



-2in> 142 
•.22 D > 

<-.21/' niiec_f t ere 

2 2 J > Wo ve 1 i> b q 0 ence 



1 r_- 3 L t £i.';*T ai;i-.C:A^r.ar.<s ggg^gacccg attat^ist* C1 t t y 4 9 



.10- 


1-13 




: u 


19 




2L2 > 


DMA 






Art if i 




£20:- 






:-2i- 


miEC f 











•-'.4 00.' 1J3 

ct gaaagtt-g tcgctgacc 19 



^'10 


I 4 A 


-✓Ll> 




- :-L2:- 


DMA 


:: 13 -- 


Artif i-Lii * J- f^^n^no* 


- : >o - 




v '. 1: j > 


ITliftC I'ftflt'dlffc 




Novel Stqjtnce 


■i 4 <J D > 


144 


gcgtaa 


t ac-rt -net c-s-T-t v g.L.i<j ^ gs < 


<: Lf>> 


US 


<;:ll > 


21 


<vl2> 


DWA 



<213> Artificial Segue 



BNS:")(".x-:iLi - WO 0 HU47.nA?"* I 



WO LI 1/36473 



PClYUSCJO/JlttH 



.22 U> 

~223> Wcvei Sequence: 
1jfJ> 14 5 

•'jittstccs csctttqacc c ' ,L± 

iLD> HP 

: 1 :3 > Art J. f j. C i a 1 Eeqn frric & 



120 5 



mitc_featuxe 
Novel Sequenca 



jccjtaatiics) nutuacKiih* (j^ao^ccl gcaaaflt.txi tsr.aagt:rtn^ 

r 1 l)*> 1-0? 

19 

OlJ > Artificial Sequence 

-: J 2 1 > mi c_ f * a t y ft 
: 2* 2- ^ > V.0 vf-1 rift^uence 

-:JI]LI> U? 

riq^rtC^cag^: gcaacctcc 

^I'lO 1-1 B 

Oil> 4% 

--. 112> DHA 

i ' 1 2 > Art i f ICO. rj. L S^(TiaK"f: ^ 

0 2C> 

<"Jil> Jlli M:_f^rt t/urifi 

.■1Q0> 1 5 B 

[jgLCtacitac gactcactat ^grrn^i^icx^. 4^.14.11;;^:^ cs*ai:i:fc<:^ 

■:-ilO> 14$ 
■:.^n> £>? 

Artificial Sequence 

■: ,'2:U> 

rage 32 



WO U 1/36473 



pcr/ufiao/jissj 



novel dequerice 



c-tcjtaaaatt cacacdaqcs dc 





irii3C_teatuxe 
rtovel Sequence 



caccct c 



<: l i > si 

Artificial iieq-uej-jze 

--" 2 2- 1 > (Tt i z> jf c- 3. t ij. r c. 
< ;> > "H r> v <z 1 5 £ n u j i c t 



<'i0-0> LGi 

goatgaattc dC^atgccag t gataagg-aa q 3L 



152 

< r 1 1 > :i 1 

< i' 1 J;i > ft j I., i T i inl lji j*-.ri.u<-: 

<I2 0> 

<2i.l> Mac_ft!dtijre 
< I* 2 2 > N q v e 1 Sequence 



<4 0P> 1-52 

n^t c a -a get t gga-atg-fitgo cottttgcca c 31 



^;iC'> is 3 





mi 3 c_f e.a t u jte 
Novel SiE<-rTij^irjc» 



BNSDOGiD -.w;j ::-JC4 7.M2TI > 



WO LI 1/36473 



PC'lYUiiCMms»l 



<-i 0C«> 153 

ijstcctcg&ci catcattcai aytagcitgcf 



2? 





154 


:?L1- 


A 2 


Z L 2 '-■ 


I'KA 


] 


Artificial SBqii-sncti 






■* r 2 2 1 > 


jTiiec_ttiat ure 


- .2 i - • r* 


Novel SiEsLjLlt: j"j Ctr 


4 00 


■1 r j 


C| D t C C t 


cr<sg ctatgaactc iatti 






- y.i i 




-:21L:.' 




'■2\?-> 


Aitilifial Eequ&ncfi 








misc_f eat e 




Hovel Sequence 




155 








^ll- 








•?1?:^ 


Artificial Sequence 


<L:JIJ> 








•'- 22 z > 





■12 



39 



^iOD.-- 156 

ggatagaaaa at qcat a^ac cattggatta a^r.gtag.nyt l-i-!iai;ttfn;n ■) 



:2JZ\: 


157 




'2\ L- 


35 




'"1 


DNA 




*■ ,\ ] .3> 


AjtL .1 i j.t:l.il £-~<3i 




*: 220 - 






221 ■ 


itiIec future 






Novel Sequence 






157 




gategs 


attc atggacacta 


cca tqqaagc 



>VO U 1/36473 PCTj''LlSaOV315»J 



• :i.i> ?i 

020 > 

<:"21;- mis cofeature 
■-i'23> Novel Sequence 

i^Lt-trLcrj-ig t c:.i i- g l: y Qi^c/ cct<3tgoccg y 31 



OlD > 


169 


<2 1 1 




012 > 




013 > 


Artificial Sequence 








rnasc feature 



13'.- Novel Sequence 



vjlO:- 1*0 

<211:- 20 

<2i:'- EN A 

-:213- Artificial Scqyr?nci? 



220> 

<j!21> tti ta-c_f est Lire 
<22 3> Ncvel Sequence 



tqCC.TjDO.Ctg Cjtq>CCinctDC 





1 hi 






<212> 


DMA 


<212-> 


Artificial Sequence 


*"2 20> 






jti-LSc fpature 


^; t- "7 7. ;r 


Hovel SequencE: 








<210> 




<211> 


2-1 



46 



BNStO^r' -:W; :■ ::1M,4r<A<H 



WO Ll 1/36473 



<212> DNA 

<213> Artificial Seqi 



<221> m±Bc_f eature 
<2 2 3 > Novel Sequ ence 



.-. J 0 0 > 




c.cqtgt. 


itty-. tiiijactc-t at tt.c 


--■210> 


1C3 




50 




DNA 


^21 


Artifioin.1 Seqm>r)0 


<22L)> 




<i:21> 


mis£ feature 


<22 i> 





<4 0<>-.- LbJ 

qcgtaatacg actcactata gggagaccgc acgocactct ttactdtccc 



2 1 n :: 164 
11- /4 

<i 1 -j > Tvr t.i t 1 t: 1« i St quen ere 




ml sc_t eat ure 
Nc-veL Sequence 



^ A CM Ofc $ U I; L ;d i"k- 




1 S5 
DNft 



Artificial Sequence 




Novel sequence 



^400> 1G5 
gcgtaatacq 5 



ictcactata gggagacege acaaaac^ca attccatdag cc 




166 
ON A 



>VO U 1/36473 PCr/USUOttlttfl 



<400> Lt-h 

qcticgccac t ct tt act at ccc 



c 21' '-- 

::213> 



LS7 

Ar 1 1 f ±c 



Sequence 



■-..2 20- 



ni &c_feature 
Nc-ve L S equ ©nee 



•?<:-rjt ti&tvc*} ftOV.OACK&tx iqq y y*c-£t-t flt^iHiiqca attcatccc 



--2LL'> 


lfiB 


s:2Ll> 


2 0 


<2L2> 


DNA 




Artificial Sequence 


; 22 n> 




- 22~±> 


irdsc f e-stuxe 


*223* 


Hovel So^ijflncy 



•■ -J o L" > it e* 

C dCiCCCdCC a .3 y : c; Ef a t C a C| 



<21fl:= 169 
^1L:- -IS 

< L' 1 Al 1 1 f i C J 1 o httUlC-r it.--:: 



:2 2ti> 



misc_f eature 
Novel S eque jj c e 



^■JOC's 169 

q< ":y l.ftcq i ;:,>■:; -tat a qq^.i ^er^c* cac-ccaccaa gaaatc-ag 



<210> 


170 


<211> 


21 


>212> 




<2 1 2- 


Artificial SecrijencG' 


<220- 




<221^ 


jru.se feature 


<22j> 


N o v n L S t*- q u e ri c- c 



bnslx' j -.we; m>,47-jArr > 



WO U 1/36473 



pcx/usuumsai 





1 7 0 


ttatOdQCdii caattcat CL C 


<210> 


171 


--:21i> 






DMA 




Artificial Sequence 


c;:o> 






mi fiC_r mature 




"Novel itqutnee 



21 



■:jr<:it^.?.tacq ^ctcact^ts '^^rT«cc^ a t.ta fc ccaca ctttgaccc 





112 


c ill - 




- j 12 > 




< j!3-> 


Artificial S^quenc* 






-121- 


rTii.sc feature 




WovbI Sequence 




172 





173 


*: 1- 11 > 




-I12> 


DtiA 


-:21j- 


Artificial S^qu«m 






■:^21> 


mi so feuturc 







<.1D0> 17 2: 

gcgtsatacg actcactata gggagaccct gctgaaagtt gtcgctgacc 



-:^:10:= l "7 4 

.-.21 Artificial sequence 



< 22u> 

< 2 2 1 > mi sc_f est ure 
<222> wovtl Sequence 



<400> 174 
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crjattatcca c-actttg^cr: e- 



<?10> 


175 




<21i> 






<Z12> 


DMA 




< z i. > 














tiiIec feature 






Novel Sequence 






175 




q-. c(t-a at *icg act c-nct n t a 


gqgacj-Jiccct 



5Q 



<210> 


17b 


Cll> 


IS- 


-11?"> 


DMA 


:11V- 


Artificial Sequence 






-221^ 









<4Cnj> 17 1 

ai-jaagacsga qcaicctcc 





L77 


:i'll5 


47 






-I* 17- 


Arti f fi ri -vs l S^nvr-n 






<: >■! > 


iiii-;f: LriiLure 




Hove J. Ssqufein^fc 



•-jLm:» 177 

gcg-taatacg actcact^ta gqg.-sqacc-ag aagacagagc aacct c-c 



-:ir» 


170 






oil':* 




<*riii> 


JUctificisl Secjue/ic* 






<-22i> 


miac featury 




Hc-vel Seque nee 


<a u [.:•: ■ 


17fd 



rrt.gt rt.saitt £rac2cra£.g?a, 



x:>D 
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<210> 1"? 5 

-:21L> 31 

-212= DMA 

J.3> Artificial SecjuencB 

*223s HD"el S^qu^rtC^ 



v4 00-- L7? 

cicatcig at cc tott t get gt atttcaccct c 



■^lU-- 


Leo 








DMA 


<2\y> 


Artificial sequence 


-220- 






mi 3 c- feature 







-:ai)Q> 180 

ycatLjSdt.r.sT ^nEatgccaGi tgAt.aagg^ m 



<210> 181 
<211> ?0 
OH A 

■;?L3> Artificial Sequence 



31 



o-2:i> Novel S«^ueft^ 



«ioo> lai 

aciqccccaa agccaaacac 



132 

C2 20> 

<:J2L> misc_f eatujfE 
-■-2 2:'-- rtovel Sequence 



< 2 1 -0 > 133 
s:ilL> 20 

< 2 1 2 > Lu^A 

Page 90 
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Artificial Sequence 




Novel $ <i 1 '<-*?■ 



ctqtrctctct c,tcct ct_ t cc 




184 

22 

DMA 

Artificial ^equencs 



<22l,- n i ,i^c_i mature 
<J23-* Hovel Sequence 



-:4 00-: 1 B-l 

^cnc . gat ct tcattgac-tt tc 



<210> IKS 

■ ah > lies 

<212> DMA 












■aqgc* cgcgc 






ccc-cacigg 1 5 


cctqgcgagc 


cqc tacicgcc 


t,r> 


.1 h yLir^iX-O*] 


i,-r.?i_i uj ljh :Drh 




ct cctrigt ga 


tgc3t act gcie 


cgtggcgcts 


12D 




.r.rtt:!". ygr.C/Ct 


qr:rtt ijt r h'% 




■jt p^gct ccq 


Dac-tcc;agDC 


1B0 


T.O tiyi'^C y LC C 


t I^Ht y Qt_ ll; ■'3 M 




oicjoo a 'H= 


tgetgnogne 


gctggacat £ 


240 


CCCt t CdCqC 


tgctcgqtqt 


gat Qcgcugq 


c cig a c: l±ci c .j t. 


^yiiiO-yOCOi ;<j 






gt-.catt gc^ct 




cttcct^^cq 


t cca ei cgegg 


cQT.t-gagLrLjtL 




360 


-js-jcncagacc; 


cgtggct ggo 


*q tcjggc 1 1-7 


cc?:ctgcq Dt 


acgccggacg 


cctqcgaci: y 




>:yL; t ;i-|-.i9t:c:ij 


o< i = i: 13 1^ 1 . y c: K 


pq q r:hq t ; 


t nngqncsg t 


.-.gntgncctt 


ct caggeget 




qcact tggct 


g.- 1 cq tctgct 


t_q gct_a i?d Q-7 


rtLit.qcLil i.-r. g 


■■>:jtr:s:: 1. ql. 1 n 


-qt't qr^rjCCt g 


54 0 


ccgcccgagc: 


ct yi^gcqt co 


gcgcttcgc.3 


gccttcac:g 


" Cd cget: ltci 




goo 


ttcgtgctgc 


cqctqgcgcjt 


get ctg net c 


acctcgctcc 


aggtgceccg 


ggtgg caege 


t.£G 


•?qo tr.n pc~-' 


oxjO.jun tijno 


c:«c:c:4t. t:fH,-.\. 


li tneingtHcgc 


t c-ncgctqct 


cciccga cct 17 


720 




tgcq-gcagcg 


ctgeer catc 


h y iii^ l: 


qgL:qt-L-:qr:L:.^ 


i;;t:L!-: : 13 r : [/:.;•> or: 


7£D 


sggaaqa t tg 


gcatt gctst 


tqc^accttc 


ctcatctgct 






a 4 c- 
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aggDtggosg aqctcgtqcc cttcgtcacc gtgascgccc agtggggcat cctcagta^ 300 

tgwtgacct aca^aaggc g;gtggccg.ac Dcgttcacgt actctctgct ccgc.-gy c.-g ?60 

tt'.^iTcc^^n tcctggcogs cat gptgcac eggctgctga agagaacccc gcgcccagca lu£o 

hcv^v-^tn Eicaf?ctCtOt l^'j^o 9 «u : n t ggtqc accagctgct gaagaqaacc 103 0 

Cicgcqccc^ iryt.f:ca^a fiAac^cLtt. gt; r>:iq nc?*atgottc ctgcctgcag 11-60 

ca^cacatt gaggcicctgs ragggctcat cgc-ucooa.:.: : it cL*e<jn HB3 



<2 10> IBb 
^211> 3C3 
-.21.::- PBT 
<21's- K . Sop-ienS 

Met Gly Fro Gly Glu Ala Leu Leu Ala Gly Leu L*u> v*l MoL V«L Len 
t 5 10 15 

<Qa Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cya Cy* Ala Tyr 
20 25 30 

Al □ Cli? Ley Am Thr Arq Ala Sor Gly Vol Lou Lou Val Aan Leu 

Leu GLy his Leu Leu Leu Ala Ala Leu Asp M*f. Pn> Phr Thr Leu 

50 55 

Leu Glv Val Met flrq Gly Arg Thr Pro Sex Ala I'jtd Gly Ala Cys Girj 
G5 " '70 75 80 

V*l 31* Glv Fh* Leu Asp Thr Fhe Leu- Aln S*r A^n Ala Ala Leu £jer 
e .c 90 35 



il A.I.* AT 



a Lku !?er Ala 51" Trp T.*u V«l Glv Fhc Pro leu 



1GS 



1 1 0 



Ai-q Tyx Ala Gly Arg Leu Arq Pro Arg Tyr Ala Gly i_*u Leu Lev. Gly 
115 120 125 

Phe S-ar Gly Ala Ala Lou Gly Oya 
14 0 



Ala Txp Gly Gin Ser Leu Alt 
130 135 



S<ir Trr> Gly T Y r S«r Sr.r Al* Phr. Ala Scr Cys Scr Leu Arg Lou 

i j s ' 155 ^ 

P_rv, Pro Glu Pru Glu flic: Pro Arci Plie 7\±* Ala P5-.~ Thr Al* 7hr l.rsy 

If.E. " 1?D 

K±b J la Val Gly Phe Val Leu Pro Leu Ma Val Leu Gya Lou Thr ter 

160 IBS- 190 

Gin "fll Hi* Atrq v.* 1 A].a Arg Art> Hi.?. Cy* GJt. ftry M«t ;V=F> Thr 
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v^i Thr Mat Lys jiio Len Ala Leu Lku Al,? Asp iah.i Hi* Pro Eer val 
23 0 215 220 

Arc Gin Arc; Cys Leu lie Gin Gin Lys Arc; Arg Arg His Arg Ala Thr 
22S 2 3 0 23E 2 4 0 

Arg Lya lie Giy lie. Ala lie Ale Thr fhe Leu He Cys Phe Ala Pxo 
24 5 2 5C> 25 5 

Tvr Val Met Thr Arq L*u Al* Glu Leu Vnl Pre- Phe Val Thr VaL Asn 
2S0 " 265 2"?0 

Alrt Gin Txp Gly Il« p&r Lys Cv* Thr Tyr 5t^r T..y* r\la V fl 1 

27s 

aj.S A$p Pro Phft Thr Tyr S&r Lt:u Leu Arg Arg Fx-d Fhe Arg Gin vsl 

L* ll Ala ij L y Met Val His Arq Leu Leu Lys Arg Thr Pid Arg Pre- Ala 
310 " 315 320 

Ser Thr Kis Asp St^r S^r Luu Asp Val Ala GVy Met Val H->$ Gin Lou 
:}5>5 330 :V_-i* 

Lhh Lvs Anj Thr Pro Arg Pro Ala Eer Thr Hie Aen Gly Val 

Thr Glu flan Aap Ser Cys Leu -Gin Gin Thr His 
3 b [> 3 t"*C« 

<210:- 1S7 

<21l; ^S" 

<;h:- dna 

<:^!"l- : '> 7irt.il tKial ^Kqutnce 

<2£0> 

< 2 2 1 > ml b c_f eature 
<22 2> Novel Sequence 

<A00-> 1ST 

g cat an get t gouat .ijrjyfcc: £Cq <j 3 pg 

<21('- 1 • H 

1 2* 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223*- Novel Sequel 

< -0 0 {} ■- 1. 1.1 H 

i^i^L.t.nt.a^rt ectcagtgtg tetgetge AS 
<2iU> 189 
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1 



PtlVUiiClO/JlMH 



< U 1 3 > ftrt 1 f i ci a i £ eg j ence 
<221-- mi £c_t y^it ".:r^ 



<213'> Artificial SnquEP^r; 




T4 1. '■■/<:>]. />^<_;Ljej"i fie* 



<^ii|j k , leg 



